Five AI Projects: Every Data Student Should Ship Before Graduating

The projects on your CV are the same projects on everyone else's CV. Same RAG demo on the same PDFs. Same Kaggle classifier. Same Streamlit dashboard. Same Titanic dataset, same MNIST, same fine-tuning notebook copy-pasted from a Medium article. Your GitHub looks like everyone else's GitHub. Your "final year project" looks like the one three years before yours, and the one three years after.

Recruiters see two hundred CVs a week that all look like yours.

Figure 01 — The sameness problem

Two hundred CVs a week. The recruiter's mental image looks like this.

CVs per week, per recruiter, in a single posting

Click "If they shipped what's in this article" — one card changes. That is the entire premise of this essay.

The market changed. The bar moved. The projects that got someone hired in 2021 won't get you a callback in 2026. This essay is about what to ship instead — five projects that signal you've done the work nobody around you bothered to do.

Most students stop at the first phase of a project — sometimes from inexperience, sometimes from laziness — right when the problems start to become interesting. Some teachers and alumni will tell you that's fine, that juniors should focus on junior-level tasks and let the senior work come with time. We disagree, and the labour data does too.

Figure 02 — What changed under your feet

Entry-level software roles have not slowed. They have collapsed.

−0%

Employment, ages 22–25 in software dev, vs. its late-2022 peak.

−0%

Entry-level job postings over the same window.

Source: Yale School of Management, "The real job destruction from AI is hitting before careers can start" — composite chart drawn from reported figures. Read the analysis →

What was once a junior job is now done by AI. What's left for juniors? To say "yes" repeatedly to a model until it finishes the job. Nonsense. Now is the time for juniors to step into what was once considered beyond their level.

A portfolio speaks before you do — recruiters and clients read artifacts, not transcripts. Projects are where you develop taste, and knowing what to build is harder than knowing how to build it. They teach the full lifecycle — shipping, maintaining, deprecating — not just the fun middle part everyone gravitates to.

Treat projects as the artifact. Everything else is a footnote.

Antipatterns

Projects that look like work but won't get you hired.

These are the projects every recruiter has already seen this week. They're not wrong to build — they're wrong to put at the top of a CV. Cross them off; what remains is the brief for the rest of this essay.

school projectscandidates from the same school have the same CV

course learning projectsthose exist to teach the basics, not prove competence

III

plug-and-play projectswiring tools together is plumbing, not engineering

clone-of-X projects"I built a ChatGPT clone" reads as "I followed a YouTube video"

benchmark-chasing projects0.3% on a leaderboard is research, not engineering

demo-only projectsit works on your laptop, it dies on anyone else's

VII

projects without usersif no one but you ever opened it, it's a learning exercise

The standard

What good looks like.

Before the specifics, here is the bar every project below should meet. Score the project you would put on your CV today. Be ruthless — recruiters are.

Figure 03 — Score your own project

The bar every project should meet.

Tap the rows that are true of the project you'd put on your CV today.

0/ 9

solves a real problemnot "a problem I invented to justify the project"

could be used by someone elselevel 1: another developer · level 2: a non-technical person

deployed and used by othersliving somewhere other than your laptop

documented at multiple levels of technicitya recruiter, a teammate, and a senior engineer all leave understanding it

has an opinionyou made non-obvious choices and can defend them

has a failure mode you handledrate limits, bad inputs, downtime, hallucinations

has at least one metric you can quotelatency, accuracy, retention, cost per request

explainable in 60s to a non-engineer, 30m to a seniorand you have actually done both

you'd put your name on it publiclythis is the load-bearing question

Pick the project you'd show off. Score it honestly.

The five projects

The version every student ships — and the version that gets you on a call.

For each project, toggle between the MVP everyone builds and the shipped version that earns an interview. Same five problems. Two very different artifacts.

What must be thereHow to go beyond

Chunking. and a defended reason for the size — not "I picked 512"

Embedding. text → vectors

Vector database. kNN search

Retrieval step. one that you can actually evaluate, not "looks good"

Grounded prompt. refuses to hallucinate when nothing matches

"No context" handling. most student RAGs confidently make things up here

Eval set · 20–50 Qs. with expected answers, so you know if changes help or hurt

Reranking. cross-encoder or LLM-based, on top of vector search

Hybrid search. semantic + BM25 — proper nouns, IDs, codes survive

Real UI. Next.js, Streamlit, or a Slack bot — somewhere non-technical users reach it

Multiple sources. PDFs + web + database + APIs

Query rewriting / decomposition. multi-hop questions actually work

Citations. every answer points back to its source chunk, clickable

Conversation memory. follow-ups like "what about the second one?" resolve

Structured extraction. tables, dates, amounts — not raw text dumps

Feedback loop. thumbs up/down, logged for later analysis

Cost & latency tracking. you can quote $/query and ms/query

The one decision that signals you understood the problem You can explain, in one sentence, why retrieval fails on your hardest five queries — and what you'd do about it.

Examples worth shipping

01Your country's legal code, central bank reports, or academic papers in your field

02Your own Obsidian vault and lecture PDFs — bonus: you become its first real user

03A profession's documents: doctors' guidelines, tax law, building codes — niche beats general

04A podcast or YouTube channel's transcripts — multimodal-adjacent, easy to demo

05A company or internship's public documentation — turns into a real interview conversation

06A GitHub repository: code + issues + PRs — useful for explaining unfamiliar codebases

What must be thereHow to go beyond

Named user · named task. "Salma spends 2h every Friday consolidating 3 emails into one sheet"

Measurable "before". how long it takes manually, how often, how error-prone

End-to-end automation. no manual steps in the middle

Error handling. what happens when the email doesn't arrive, the format changes, the API returns 500

Logs and observability. the user knows it ran, what it did, what it skipped

Kill switch. the user can turn it off without calling you

Real schedule. cron, GitHub Actions, a small server — not "when I remember"

Notification layer. email, Slack, WhatsApp — the user trusts it without checking

Partial-failure handling. if 8/10 processed, the user knows which 2 failed and why

Configurable without code. a YAML file, a Google Sheet, a small admin UI

Audit trail. every run logged, every action reversible where possible

Versioning. when you change the logic, old runs are still explainable

Real hand-off. walk away for two weeks, see if it still runs

The one decision that signals you understood the problem You can quote the time it saves per week, and the user is still using it three months later.

Examples worth shipping

01An internship task you actually did by hand for two months — you know the edge cases because you lived them

02A family member's small business: a shop, a clinic, a freelance practice

03A student club's process: event registrations, member comms, treasury

04Your own job search: tracking applications, follow-ups, recruiter responses across platforms

05A researcher's grunt work in your department — papers to organize, citations to format, datasets to clean

What must be thereHow to go beyond

Real, updating dataset. not a static CSV downloaded once

Clear primary user. and the decision they're trying to make when they open it

3–5 metrics. that map to that decision — not 20 charts because they looked nice

Real filtering and drill-down. time comparisons (WoW, YoY) that actually work

Loads in under 3 seconds. non-technical users close slow tabs

Works on mobile. because that's where half your users will open it

Natural-language queries. "show me last month's top 5 clients by revenue" — no SQL

Automatic anomaly detection. the dashboard tells the user what's unusual

Scheduled summaries. Monday-morning email with the 3 things that changed last week

Role-based views. a salesperson sees their pipeline; a manager sees the team's

Export to real formats. PDF for management, Excel for finance, image for WhatsApp

"Why did this change?" walks back the cause

Comments and annotations. users leave notes on data points for the next person

The one decision that signals you understood the problem A non-technical person used it for ten minutes without help, and came back the next day on their own.

Examples worth shipping

01A small business currently running on WhatsApp screenshots and Excel files

02A sports club, NGO, or association tracking what they actually care about

03A personal-finance dashboard from your real bank statements — you'll feel bad UX immediately

04A public-data dashboard your city doesn't have but should — air quality, transit reliability

05A researcher in your faculty tracking their own KPIs — papers, citations, grants

What must be thereHow to go beyond

One profession · one sub-task. "help notaries draft preliminary sale agreements" — not "help lawyers with law"

Domain-grounded outputs. every claim or clause traceable to a source the professional trusts

A workflow, not just a chat. input → structured steps → reviewable output

Export to their format. Word with their letterhead — not Markdown

Field-appropriate guardrails. medical, legal, financial outputs need stronger refusals than marketing

Used on real cases. by at least one real professional — not a demo audience

Domain-specific eval set. built with the professional, covering the cases that matter to them

Visible multi-step reasoning. they need to verify, not just consume

Integration with their stack. DocuSign, practice management, accounting platforms

Knowledge base of their own work. the assistant learns from prior cases

Field-appropriate compliance. HIPAA-adjacent for medical, attorney-client privilege for legal

Pricing & packaging. work out the cost and who pays, even if you don't sell yet

"Not a replacement for judgment". enforced in the product, not just the footer

The one decision that signals you understood the problem A working professional would pay for it, and you can name the price and the budget line it comes out of.

Examples worth shipping

01A contract review assistant for small-firm lawyers in a specific jurisdiction

02A clinical-note structurer for GPs — voice in, structured note out

03A tax-filing assistant for freelancers in a specific country and tax regime

04A tender-response assistant for small consulting firms bidding on government contracts

05A lesson-plan assistant for teachers in a specific grade and subject

06A real-estate listing assistant — photo in, listing copy + comparables out

What must be thereHow to go beyond

One narrow, named task. clear input, clear output

Multiple tools in sequence. not a single LLM call dressed up as an agent

Clear stopping condition. knows when it's done, doesn't loop forever, doesn't burn tokens

Human-in-the-loop at the right step. review before sending — not after damage is done

Better than "the agent crashed". partial outputs, retries, escalation

Cost per run, tracked. $4/run isn't a product, it's a bug

Real run, real work, accepted. without the human rewriting it

Cross-run memory. remembers context, preferences, prior decisions per user

Visible plan before execution. agents that announce their plan are easier to trust

Self-correction. detects broken link, malformed file, wrong recipient — fixes before delivering

Evals you run on every change. agents regress silently — catch it

Multi-agent only when it earns its keep. most multi-agent systems are one agent that should have stayed one

Clean hand-off format. the output is the input to the next person's workflow

Observability. every tool call, decision, and token logged and inspectable

The one decision that signals you understood the problem A professional would let your agent run on their real work without supervising it for the full duration.

Examples worth shipping

01Sales call → CRM entry + follow-up email + calendar invite + flagged objections, handed back in 5 min

02Inbox triage drafts for routine categories (scheduling, status, document requests) — human approves and sends

03Bug-report line → reproducible test case + draft fix as a PR

04Job description → tailored CV + cover letter + recruiter outreach, grounded in your real experience

05Onboarding a new client for a small consulting firm: contract, kickoff doc, shared folder, first-week schedule

06PR reviews against this codebase's conventions — not generic linting

Closing

Pick one. Not five.

One project, shipped to the standard above, beats five half-built ones every time. Pick the one closest to a real user you have access to — the lawyer in your family, the manager at your internship, the doctor friend. Domain access is the unfair advantage students underuse.

Ship before it's ready. Polish in public. Nobody hires the person who's "still working on it" forever. Write about it. Get one real user — not your study group; one person who didn't know you before, who chose to use it, and who'd notice if it broke. That's the line between a project and a product.

The students who do this will be the juniors who don't get replaced. Everyone else is competing for the seat that AI is quietly removing.

$ git init

Open a new repo today. Name it the thing you're going to build — not the thing you've already built.

Entry-level software roles have not slowed. They have collapsed.

Projects that look like work but won't get you hired.

What good looks like.

The version every student ships — and the version that gets you on a call.

Pick one. Not five.

Get new essays in your inbox.