GitHub - Aliipou/culture-identifier: NLP-powered personality analyzer that matches your writing style to iconic French thinkers and artists using semantic embeddings

Match your writing style to iconic French philosophers, writers, and artists.

Not about what you say — about how you say it.

The Idea

Every writer has a signature. The rhythm of their sentences, the density of their vocabulary, the way they build an argument or paint an image. These stylistic fingerprints persist across topics — Camus writes about football differently than Flaubert, even when both are being concise.

This system encodes those fingerprints as dense semantic vectors and finds whose voice yours most resembles.

How It Works

Your text
    |
    v
[Sentence Embedder]      Encodes your text into a 768-dim semantic vector
    |                    using a fine-tuned multilingual sentence-transformer
    v
[Style Profiles]         Pre-computed embeddings of each cultural figure's
    |                    representative works (essays, letters, excerpts)
    v
[Cosine Similarity]      Ranks all figures by similarity to your text
    |
    v
[Explanation Engine]     Highlights the specific stylistic features
                         that drove the match (sentence length, lexical
                         density, rhetorical patterns, emotional register)

Cultural Figures

Philosophers

Writers

Artists

Jean-Paul Sartre
Albert Camus
Simone de Beauvoir
Voltaire
René Descartes
Blaise Pascal

Gustave Flaubert
Marcel Proust
Émile Zola
Victor Hugo
Charles Baudelaire
Arthur Rimbaud

Claude Monet
Pablo Picasso
Marcel Duchamp

(artistic philosophy)

Quick Start

git clone https://github.com/Aliipou/culture-identifier.git
cd culture-identifier
python -m venv .venv && source .venv/bin/activate
pip install -r requirements.txt
python app.py

Open http://localhost:5000 in your browser.

Example

Input: "The absurdity of existence does not negate our freedom to choose.
        On the contrary, it is precisely because nothing is predetermined
        that every choice carries its full weight."

Top Match: Albert Camus (0.91)
Reason:    Existential framing, short declarative sentences, use of
           paradox to reveal rather than obscure, direct address
           of the reader.

Runner-up: Jean-Paul Sartre (0.84)
Runner-up: Simone de Beauvoir (0.79)

Project Structure

culture-identifier/
├── app.py                  Flask application and routes
├── analyzer/
│   ├── embedder.py         Sentence-transformer wrapper
│   ├── profiles.py         Pre-computed cultural figure embeddings
│   ├── similarity.py       Cosine similarity and ranking
│   └── explainer.py        Feature extraction for match explanation
├── data/
│   └── figures/            Source texts for each cultural figure
├── static/                 CSS, JS, images
├── templates/              HTML templates
├── tests/
│   ├── test_embedder.py
│   ├── test_similarity.py
│   └── test_explainer.py
└── requirements.txt

Screenshots

Main Interface

Text input with real-time character count and language hints

Analysis Results

Ranked cultural figure matches with similarity scores and style explanations

Match Detail View

Detailed breakdown of stylistic features driving the match

Tech Stack

Component	Technology
Backend	Python, Flask
NLP	`sentence-transformers` (multilingual-MiniLM-L12-v2)
Similarity	Cosine similarity via `scikit-learn`
Frontend	Vanilla JS, CSS animations
Model	Runs locally, no API key needed

Extending It

Adding a new figure

Add source texts to data/figures/your_figure.txt
Run python scripts/build_profiles.py to recompute embeddings
Add metadata to analyzer/profiles.py

Changing the model The embedder is swappable. Any sentence-transformers compatible model works. Multilingual models handle French source texts better.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.github/workflows		.github/workflows
analyzer		analyzer
backend/app		backend/app
data		data
docs		docs
frontend		frontend
screenshots		screenshots
tests		tests
.dockerignore		.dockerignore
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
Screenshot_1.jpg		Screenshot_1.jpg
Screenshot_2.jpg		Screenshot_2.jpg
Screenshot_3.jpg		Screenshot_3.jpg
Screenshot_5.jpg		Screenshot_5.jpg
cultural_personality_analyzer_software_development_plan.md		cultural_personality_analyzer_software_development_plan.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt
run.bat		run.bat
test_system.py		test_system.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Idea

How It Works

Cultural Figures

Quick Start

Example

Project Structure

Screenshots

Main Interface

Analysis Results

Match Detail View

Tech Stack

Extending It

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

The Idea

How It Works

Cultural Figures

Quick Start

Example

Project Structure

Screenshots

Main Interface

Analysis Results

Match Detail View

Tech Stack

Extending It

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages