Libraries: Language modeling with PyTorch (1) #303

edoardob90 · 2025-05-07T13:33:54Z

Part 1 notebook introduces the fundamentals of neural network-based language modeling, from traditional bi-gram approaches to the simplest neural networks.

Implementation of a single-layer neural network for character-level language modeling
Comparison with the bigram model approach, highlighting similarities in performance but differences in flexibility
Step-by-step explanation of the neural network pipeline:
- One-hot encoding of character inputs
- Forward pass through a weight matrix
- Softmax transformation to obtain probability distributions
- Loss calculation using negative log-likelihood
- Backward pass for gradient computation
- Weight updates using gradient descent
Introduction to regularization
Demonstration of sampling from the trained model

It's the first step of a step-by-step introduction/overview of language modeling using PyTorch library.

edoardob90 · 2025-05-07T13:42:59Z

Left to do:

Add more references to extra material
Update Table of Contents
Finalize the exercises

25 -> 26 to avoid name clashes

Solutions are proposed in a separate notebook

edoardob90 · 2025-05-10T20:39:52Z

Left to do:

Add solutions notebook

For consistency

32_language_modeling_1.ipynb

despadam · 2025-05-12T11:08:35Z

Also, this should be included in 00_index.ipynb

…cientific-it/python-tutorial into new-material/pytorch-llm-tutorial

despadam

LGTM 👏

Snowwpanda

Looks nice, good work.

Snowwpanda · 2025-05-12T18:21:29Z

32_language_modeling_1.ipynb

+    "device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')\n",
+    "\n",
+    "# Load dataset\n",
+    "words = open('data/names.txt', 'r').read().splitlines()\n",


Suggested change

"words = open('data/names.txt', 'r').read().splitlines()\n",

"words = open('data/lm/names.txt', 'r').read().splitlines()\n",

names is in a subfolder

edoardob90 added 9 commits May 6, 2025 19:33

WIP: bigram model

95fccbb

WIP: More stuff

e44af2e

Add per-file ignore rule for Ruff

69cb49a

Update

d5bff12

Merge branch 'main' into new-material/pytorch-llm-tutorial

68c4a77

Ignore some folders

322d699

Finalized main content of part 1

98a1ffb

Revert file

90c484f

Merge branch 'main' into new-material/pytorch-llm-tutorial

5177d1c

edoardob90 added 3 commits May 9, 2025 07:56

Merge branch 'main' into new-material/pytorch-llm-tutorial

dc53320

[skip ci] Rename notebook

aeb0157

25 -> 26 to avoid name clashes

[skip ci] Added a few exercises

30207f1

Solutions are proposed in a separate notebook

edoardob90 added 5 commits May 12, 2025 09:26

Rename notebooks

5fb68a2

Merge branch 'main' into new-material/pytorch-llm-tutorial

a41f399

Add TOC

1ee4971

Rename data

9dad59f

[skip ci] Rename Pandas data as well

f36ddee

For consistency

edoardob90 force-pushed the new-material/pytorch-llm-tutorial branch from f36ddee to 9dad59f Compare May 12, 2025 08:28

[skip ci] fix typos

47826a5

despadam reviewed May 12, 2025

View reviewed changes

32_language_modeling_1.ipynb Outdated Show resolved Hide resolved

[skip ci] Remove placeholder comments

2d0a3de

despadam and others added 6 commits May 12, 2025 13:13

add section title

ba82505

[skip ci] Update index

8b2790f

Merge branch 'new-material/pytorch-llm-tutorial' of github.com:empa-s…

55b57a3

…cientific-it/python-tutorial into new-material/pytorch-llm-tutorial

Update title

7788f45

[skip ci] Update references

c05c24a

[skip ci] Update notebook; add proposed solutions to exercises

7f5bd3c

edoardob90 and others added 2 commits May 12, 2025 15:51

[skip ci] Move solutions to "extra" folder

862ba4f

[skip ci] fix last typo

f78ada6

despadam approved these changes May 12, 2025

View reviewed changes

edoardob90 merged commit cbd5572 into main May 12, 2025
1 check passed

edoardob90 deleted the new-material/pytorch-llm-tutorial branch May 12, 2025 20:23

Snowwpanda reviewed May 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Libraries: Language modeling with PyTorch (1) #303

Libraries: Language modeling with PyTorch (1) #303

Uh oh!

edoardob90 commented May 7, 2025 •

edited

Loading

Uh oh!

edoardob90 commented May 7, 2025 •

edited

Loading

Uh oh!

edoardob90 commented May 10, 2025 •

edited

Loading

Uh oh!

Uh oh!

despadam commented May 12, 2025

Uh oh!

despadam left a comment

Uh oh!

Uh oh!

Snowwpanda left a comment

Uh oh!

Snowwpanda May 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	"words = open('data/names.txt', 'r').read().splitlines()\n",
	"words = open('data/lm/names.txt', 'r').read().splitlines()\n",

Libraries: Language modeling with PyTorch (1) #303

Libraries: Language modeling with PyTorch (1) #303

Uh oh!

Conversation

edoardob90 commented May 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

edoardob90 commented May 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

edoardob90 commented May 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

despadam commented May 12, 2025

Uh oh!

despadam left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Snowwpanda left a comment

Choose a reason for hiding this comment

Uh oh!

Snowwpanda May 12, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

edoardob90 commented May 7, 2025 •

edited

Loading

edoardob90 commented May 7, 2025 •

edited

Loading

edoardob90 commented May 10, 2025 •

edited

Loading