Neural Networks Foundation — Micrograd Implementation

Overview

This project is part of a personal learning journey to deepen foundational knowledge in neural networks before moving on to attention mechanisms and transformers. The approach taken was to study Andrej Karpathy's micrograd repository (https://github.com/karpathy/micrograd) as an educational reference, understand its internals thoroughly, then implement a personal version from scratch without looking at the source code.

What is micrograd

Micrograd is a tiny scalar-valued autograd engine that implements backpropagation over a dynamically built Directed Acyclic Graph (DAG), with a PyTorch-like neural network library on top of it. Credit goes to Andrej Karpathy for the original project: https://github.com/karpathy/micrograd.

My Implementation

The implementation process consisted of two phases:

Study Phase: Carefully studying engine.py in the original repository to understand the Value class, how operations (add, mul, pow, relu) construct the computation graph, and how the backward() method utilizes topological sort to propagate gradients.
Re-implementation Phase: Closing the reference and re-implementing engine.py from scratch from memory.

During the first attempt, a few errors were identified and corrected:

Incorrect method signatures.
A typo in the __add__ method where self was wrapped instead of other.
Naming the ReLU operation __relu__, which is not a recognized Python dunder method, and correcting it to relu.

Files

engine.py

The core engine containing the Value class, which implements scalar autograd. It supports the following operations:

Addition: __add__, __radd__
Subtraction: __sub__, __rsub__
Multiplication: __mul__, __rmul__
Division: __div__, __rdiv__
Power: __pow__
Negation: __neg__
ReLU Activation: relu
Backpropagation: backward() using a topological sort of the computation graph.

test.py

A simple test script to verify the correctness of the forward and backward passes.

from engine import Value

a = Value(2.0)
b = Value(3.0)
c = a * b + a**2  # forward result: 10.0
c.backward()

print(f"a.grad: {a.grad}")  # Expected: 7.0 (dc/da = b + 2a = 3 + 4)
print(f"b.grad: {b.grad}")  # Expected: 2.0 (dc/db = a = 2)

Learning Goal

This project is a step in a broader learning path: mastering neural network fundamentals, including Recurrent Neural Networks (RNNs), Convolutional Neural Networks (CNNs), and Spiking Neural Networks (SNNs), before advancing to the study of attention mechanisms and transformer architectures.

Credit

Credit to Andrej Karpathy for the original micrograd project: https://github.com/karpathy/micrograd

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
README.md		README.md
engine.py		engine.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural Networks Foundation — Micrograd Implementation

Overview

What is micrograd

My Implementation

Files

engine.py

test.py

Learning Goal

Credit

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Neural Networks Foundation — Micrograd Implementation

Overview

What is micrograd

My Implementation

Files

engine.py

test.py

Learning Goal

Credit

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages