Projects — Jeet Ganatra

PYTHONPYTORCHGRPO 2026

post-training a 0.6B model to reason about math — GRPO vs. distillation

block × N

PYTHONPYTORCHDDP 2026

reproducing the chat model training pipeline, end to end.

PYTHONNCCLBFLOAT16 2026

124M params, custom training loop