Python + GenAI

Learn Generative AI by building real systems — step by step, in Python.

This repository is a hands-on, production-oriented learning path for modern Generative AI. It takes you from first LLM calls to full-scale AI products involving RAG, agents, vision, diffusion models, and deployment.

Everything here is designed to be:

Practical
Incremental
Production-minded
Easy to reason about and extend

No black boxes. You build every layer yourself.

What this repository is about

This is not a theory repo. This is a build-first GenAI repository.

You will:

Write real Python code
Debug real problems (context limits, hallucinations, latency)
Build reusable components
Understand why systems break — and how to fix them
Move from prompts → systems → products

Each folder represents one concrete project, focused on a single GenAI concept, implemented cleanly and explained clearly.

Learning philosophy

This repository follows four principles:

Hands-on first Every concept is learned by building something tangible.
Progressive complexity Each project builds directly on previous ones.
Production realism Topics like evaluation, guardrails, memory, orchestration, and deployment are first-class.
System-level thinking You learn how components fit together — not just how to call an API.

What this repository includes

Projects are organized in a strict learning order, from fundamentals to advanced, real-world systems.

Core Foundations

What Generative AI really is
How LLMs work internally
Tokens, context windows, cost, latency
Prompt structure and behavior control

Prompting & Conversation

Prompt engineering patterns
System vs user vs assistant roles
Conversation state management
Short-term and long-term memory

Embeddings & Search

Vector embeddings from scratch
Semantic similarity and ranking
Chunking strategies
Metadata-aware retrieval

Retrieval-Augmented Generation (RAG)

PDF and document Q&A
Query rewriting
Hybrid retrieval (keyword + vector)
Re-ranking and long-context strategies
Multi-source knowledge systems

Agents & Tooling

Tool calling and function execution
Planner–executor agents
Multi-agent collaboration
Workflow orchestration
Autonomous agents with guardrails

Reliability, Safety & Evaluation

Hallucination detection
Confidence scoring
Prompt and model regression testing
Evaluation pipelines
Error handling and fallbacks

Production Systems

API design
Observability and monitoring
Versioning and experiment tracking
Security and access control
Scalable architectures

Advanced AI

LLM fine-tuning (SFT, LoRA, PEFT)
Domain-specific models
Computer Vision with YOLO
Multimodal systems
Diffusion and Stable Diffusion
Full-stack AI products

Project-based structure (what you actually build)

Each project folder contains:

Clear goal
Minimal but complete implementation
Focused scope (one core idea)
Code you can reuse in real systems

You don’t just learn about:

RAG → you build multiple RAG systems
Agents → you build planners, executors, and workflows
Vision → you run YOLO and connect it to LLMs
Diffusion → you generate, fine-tune, and edit images

From beginner to advanced — without gaps

This repo is designed so that:

Beginners can start from project 1
Experienced engineers can jump to any phase
No conceptual leaps are required
Every advanced topic is grounded in earlier work

By the end, you will understand:

How GenAI systems actually work
How to design them cleanly
How to debug and evaluate them
How to ship them responsibly

GenAI Stack Used in This Repository

This repository is tool-opinionated on purpose.

All projects are built using a production-proven GenAI stack that reflects what real teams actually use in shipped systems, not experimental or academic setups.

You will see these tools reused across projects, so they become intuitive rather than abstract.

This stack is a reference baseline, not a hard requirement. You can swap tools — but the system design principles remain the same.

GenAI Stack (Production-Proven)

Category	Tool / Technology	Description (Real-World Usage)
Core LLMs	OpenAI (GPT)	Primary production LLM for chat, RAG, agents, reasoning, and multimodal tasks.
	Anthropic (Claude)	Secondary or fallback model; preferred for long context and safer outputs.
	Google (Gemini)	Used when tightly integrated with Google Cloud or multimodal search workflows.
	Meta (LLaMA)	Open-weight models for self-hosted, cost-controlled, or data-sensitive systems.
	Mistral AI (Mistral / Mixtral)	Lightweight open models often chosen for efficiency and EU data residency.
Model Runtime & Inference	Ollama	Local development runtime for testing and prompt iteration.
	vLLM	High-performance production inference with batching and KV caching.
Frameworks	LangChain	Orchestration framework for prompts, tools, memory, RAG, and agents.
	LlamaIndex	RAG-focused framework for indexing, chunking, and retrieval pipelines.
Agents	LangGraph	Production-grade agent workflows using explicit state graphs.
	AutoGen	Multi-agent framework mainly for experimentation and simulations.
Vector Databases	Pinecone	Managed vector DB for scalable semantic search and RAG.
	Qdrant	Self-hosted or managed alternative with strong filtering support.
	FAISS	In-memory vector search for local development and experiments.
Embeddings	OpenAI Embeddings	Default choice for most production systems.
	BGE	Open-source embeddings for self-hosted setups.
	E5	Multilingual embeddings for global applications.
Search Patterns	BM25	Keyword search for exact lexical matching.
	Hybrid Search	Keyword + vector search (most common RAG pattern).
	Re-rankers	Improves retrieval quality in high-precision systems.
Evaluation	RAGAS	Automated evaluation for RAG quality and faithfulness.
Observability	Langfuse	Tracing, prompt versioning, and debugging for LLM workflows.
	Helicone	Request-level logging, latency, and cost tracking.
Fine-Tuning	LoRA	Lightweight fine-tuning without full retraining.
	QLoRA	Memory-efficient fine-tuning on limited hardware.
	PEFT	Parameter-efficient tuning methods (LoRA/QLoRA family).
Multimodal	Whisper	Speech-to-text for voice bots and transcription systems.
	Stable Diffusion	Image generation for tools and internal workflows.
	DALL·E	Managed image generation for creative use cases.

Who this is for

This repository is ideal for:

Developers learning Generative AI seriously
Backend or full-stack engineers adding AI features
Engineers transitioning into GenAI roles
Builders creating real AI-powered products

If you like learning by building systems, this repo is for you.

How to use this repository

Recommended approach:

Start from Project 01 and move forward
Run every project locally
Modify and break things
Reuse components across projects
Treat this as a personal GenAI toolkit

You can also:

Use it as a course
Use it as a reference
Use it as a portfolio foundation

License

MIT — free to use, modify, and build on.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
01-ai-foundations-and-ecosystem		01-ai-foundations-and-ecosystem
02-language-comparison		02-language-comparison
03-ai-projects-and-qa		03-ai-projects-and-qa
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Python + GenAI

What this repository is about

Learning philosophy

What this repository includes

Core Foundations

Prompting & Conversation

Embeddings & Search

Retrieval-Augmented Generation (RAG)

Agents & Tooling

Reliability, Safety & Evaluation

Production Systems

Advanced AI

Project-based structure (what you actually build)

From beginner to advanced — without gaps

GenAI Stack Used in This Repository

GenAI Stack (Production-Proven)

Who this is for

How to use this repository

License

About

Uh oh!

Releases

Packages

Languages

mernjs/GenAI

Folders and files

Latest commit

History

Repository files navigation

Python + GenAI

What this repository is about

Learning philosophy

What this repository includes

Core Foundations

Prompting & Conversation

Embeddings & Search

Retrieval-Augmented Generation (RAG)

Agents & Tooling

Reliability, Safety & Evaluation

Production Systems

Advanced AI

Project-based structure (what you actually build)

From beginner to advanced — without gaps

GenAI Stack Used in This Repository

GenAI Stack (Production-Proven)

Who this is for

How to use this repository

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages