A comprehensive archive of transcripts from Lenny's Podcast, organized for easy use with AI coding assistants and language models.
Lenny's Podcast features interviews with world-class product leaders and growth experts, providing concrete, actionable, and tactical advice to help you build, launch, and grow your own product.
Browse by topic: Start with index/README.md to explore episodes by topic.
Search transcripts:
grep -r "product-market fit" episodes/├── episodes/ # 269 episode transcripts
│ └── {guest-name}/
│ └── transcript.md
├── index/ # AI-generated topic index
│ ├── README.md # Main entry point
│ ├── product-management.md # Episodes about product management
│ ├── leadership.md # Episodes about leadership
│ └── ... # 50+ topic files
└── scripts/
└── build-index.sh # Script to regenerate index
Each episode has its own folder named after the guest(s), containing a transcript.md file with:
-
YAML Frontmatter - Structured metadata including:
guest: Name of the guest(s)title: Full episode titleyoutube_url: Link to the YouTube videovideo_id: YouTube video IDdescription: Episode descriptionduration_seconds: Episode length in secondsduration: Human-readable durationview_count: Number of views at time of archivalchannel: Channel name
-
Transcript Content - Full text transcript of the episode
The index/ folder contains AI-generated keyword tags for each episode, organized by topic:
| Topic | Description |
|---|---|
| Product Management | 57+ episodes on PM skills and practices |
| Leadership | Episodes on management and leadership |
| Growth Strategy | Growth tactics and frameworks |
| Product-Market Fit | Finding and measuring PMF |
See index/README.md for the complete list of 50 topics.
The index is generated using Claude CLI. To regenerate:
./scripts/build-index.shThis processes transcripts through Claude to generate keyword tags. The script is idempotent - it skips episodes already present in keyword files, so it can be run multiple times safely.
Each transcript is a standalone markdown file that can be easily parsed by AI systems. The YAML frontmatter provides structured metadata that can be extracted programmatically.
import yaml
def read_transcript(filepath):
with open(filepath, 'r') as f:
content = f.read()
# Split frontmatter and content
parts = content.split('---')
if len(parts) >= 3:
frontmatter = yaml.safe_load(parts[1])
transcript = '---'.join(parts[2:])
return frontmatter, transcript
return None, content
# Example usage
metadata, transcript = read_transcript('episodes/brian-chesky/transcript.md')
print(f"Guest: {metadata['guest']}")
print(f"Title: {metadata['title']}")This archive contains 269 transcripts from Lenny's Podcast episodes.
- Transcripts: Sourced from the Lenny's Podcast Transcripts Archive
- Metadata: Extracted from the Lenny's Podcast YouTube channel
If you notice any issues with the transcripts or metadata, please open an issue or submit a pull request.
This archive is for educational and research purposes. All content belongs to Lenny's Podcast and the respective guests. Please visit the official YouTube channel to support the creators.
The transcripts are provided for personal and educational use. Please respect the original content creators' rights.