Sitemap
A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.
Pages
Posts
Future Blog Post
Published:
This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.
Read more
Blog Post number 4
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Headings are cool
You can have many headings
Aren’t headings cool?
Read moreBlog Post number 3
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Headings are cool
You can have many headings
Aren’t headings cool?
Read moreBlog Post number 2
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Headings are cool
You can have many headings
Aren’t headings cool?
Read moreBlog Post number 1
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Headings are cool
You can have many headings
Aren’t headings cool?
Read moredatasets
QnA Irrigation Diseases Dataset
This dataset contains a comprehensive Question & Answer collection focused on water management technologies, irrigation systems, and related agricultural practices for sustainable farming. The dataset is derived from technical documentation and research publications related to water management in ag
Read more
QnA Plant Diseases Dataset
This dataset contains a comprehensive Question & Answer collection focused on plant diseases, their management, treatment protocols, and diagnostic techniques. The dataset provides detailed information about various plant pathologies, fungicide applications, and disease identification methods for ag
Read more
QnA Soil Diseases Dataset
This dataset contains a comprehensive Question & Answer collection focused on soil management, soil health, organic farming practices, and soil-related agricultural techniques. The dataset is derived from technical guides and documentation related to sustainable soil management practices, with empha
Read more
models
ViT
Published:
ViT-B/16 from scratch on a 3-class Food-101 subset. Train loss 1.20 / test loss 1.52. Read more
GPT
Published:
Decoder-only transformer trained on TinyShakespeare, replicating the original OpenAI GPT architecture from scratch. Read more
BERT
Published:
Bidirectional encoder pre-trained with masked language modelling on the Cornell Movie Dialogs corpus. Read more
CycleGANs
Published:
Cycle-consistent unpaired image translation on Cityscapes — two generators, two discriminators, cycle + identity losses. Read more
Differential Transformer
Published:
Differential attention replicated from scratch — two attention maps subtracted to cancel noise. Trained on TinyShakespeare on A100. Read more
Encoder-Decoder
Published:
LSTM-based Seq2Seq encoder-decoder for German→English translation. Train/val loss ~1.38 in 10 epochs. Read more
Fine Tuning using PEFT
Published:
QLoRA fine-tuning scripts using PEFT + BitsAndBytes for both decoder and encoder-type models. Read more
GRU
Published:
GRU from scratch. 16 hidden units, 50 epochs. Train loss 0.51 / val loss 0.48. Read more
Attention Mechanisms
Published:
From-scratch implementations of Bahdanau and Luong attention in PyTorch. Read more
RNNs
Published:
Vanilla RNN from scratch. 16 neurons, 50 epochs. Train loss 0.51 / val loss 0.50. Read more
Transformer
Published:
Encoder-decoder transformer for English→Hindi translation on Samanantar (~25M params). Published on HuggingFace. Read more
Mixtral
Published:
Sparse MoE transformer replicated from scratch on TinyShakespeare. Train loss 2.04 / val loss 2.09 in 1,000 steps on T4. Read more
DPO
Published:
Direct Preference Optimization applied to Qwen0.5B-Instruct on UltraFeedback. Train loss 0.67 in 3,000 iterations. Read more
SimplePO
Published:
Reference-free preference optimization (SimplePO) on OPT-330M. Batch size 128, lr=2e-5, beta=2 on UltraFeedback. Read more
LoRA
Published:
Low-rank adaptation implemented from scratch in PyTorch. Train/val loss ~3.5 in 1,000 steps on A100. Read more
ORPO
Published:
Odds Ratio Preference Optimization on OPT-330M. Reference-free alignment reaching train loss 1.70 in 3,000 iterations. Read more
Gemma
Published:
Google’s Gemma architecture replicated from scratch — multi-query attention and GeGLU activations on TinyShakespeare. Read more
Llama
Published:
Decoder-only Llama replicated from scratch with RoPE, SwiGLU, RMSNorm and GQA. Read more
CLiP
Published:
Contrastive vision-language model trained on Flickr8K. Train loss 1.3 / val loss 2.2 in 30 epochs on T4. Read more
DDP
Published:
Llama trained with PyTorch DistributedDataParallel (torchrun). Val loss 1.1 in 8,000 iterations on TinyShakespeare. Read more
Llava
Published:
Visual instruction tuning replicated from scratch on Flickr8K. Train loss 0.23 / val loss 0.22 in 5 epochs on T4. Read more
Seq2Seq
Published:
GRU-based Seq2Seq with both Bahdanau and Luong attention from scratch. 128 hidden units, 50 epochs. Read more
Whisper
Published:
Whisper ASR from scratch — CNN on 80-channel mel spectrograms + 6-layer transformer decoder. Trained on GigaSpeech. Read more
LSTM
Published:
LSTM from scratch (~128K params). 128 hidden units, 50 epochs. Train loss 0.49 / val loss 0.48. Read more
Gemma3
Published:
90M-parameter Gemma 3 with local sliding-window attention (128-token blocks). Val loss 1.77 in 25k steps on TinyStories. Read more
Llama4
Published:
1.2B-parameter MoE (32×12M experts, top-1 routing) trained on TinyStories. Val loss 1.70 in 20k steps on Kaggle P100. Read more
Moonshine
Published:
Compact transformer ASR (288-dim, 6 heads) trained on GigaSpeech for 1,500 steps. Notes on overfitting at ~25 hours. Read more
PaliGemma
Published:
Google’s PaliGemma VLM (SigLIP + Gemma) replicated from scratch on Flickr8K. Read more
Pix2Pix
Published:
Conditional GAN for paired image-to-image translation (aerial→map) replicated from scratch. PatchGAN discriminator. Read more
SigLip
Published:
Sigmoid-loss vision-language pretraining replicated from scratch on Flickr8K — avoids global softmax normalisation. Read more
TTS
Published:
Tacotron-style transformer TTS from scratch — 512-dim phoneme encoder, mel spectrogram decoder, 16kHz on GigaSpeech. Read more
VAE
Published:
VAE on CelebA (128×128). 4-layer conv encoder, 32D latent, ConvTranspose decoder. Reconstruction + KL loss over 200 epochs. Read more
WGANs
Published:
Wasserstein GAN and WGAN-GP implemented from scratch on MNIST — gradient penalty for stable training. Read more
Kimi-K2
Published:
DeepSeekV3-inspired MoE with latent attention trained with Muon optimizer. Pre-trained weights on HuggingFace. Read more
CGANs
Published:
Conditional GAN on MNIST — class-conditioned 64×64 digit generation. 30 epochs, BCE loss, TensorBoard logging. Read more
CLAP
Published:
Contrastive Language-Audio Pretraining from scratch on GigaSpeech. 768D text / 2048D audio → 1024D shared space. Read more
DCGANs
Published:
Deep Convolutional GAN trained on CelebA and CIFAR-10. ~7,800 steps (CelebA) and ~11,700 steps (CIFAR-10). Read more
DeepSeekV3
Published:
16×4 MoE with Multi-head Latent Attention and auxiliary-free load balancing, trained on TinyStories on Kaggle P100. Read more
portfolio
Portfolio item number 1
Short description of portfolio item number 1
Read more
Portfolio item number 2
Short description of portfolio item number 2
Read more
projects
| Movies Review System | Spoiler-Free Sentiment-Analysis based Movies Review System) |
Published:
Introducing the Movie Review System, where AI meets movie magic to revolutionize how viewers experience films. This project goal is to provide an interface for spoiler-free reviews and sentiment analysis, enhancing the viewing journey. With advanced models like Voting Classifier and Bi-LSTMs powered by Keras and TensorFlow, we achieve impressive metrics—a 91% accuracy, 91% precision, and 90% recall that understands and enhances the users’ with the respective movies’ plot from a bird’s eye view. Read more
| MoviesMania (Geek-o-thon) | A Reverse Search based Movies Recommendation System |
Published:
Step into the future of entertainment discovery with MoviesMania. The rpoduct aims to simplify your search for the perfect movie or web series. Using various AI/ML techniques and elements, we analyze uploaded video clips to predict movie titles and recommend similar content with an impressive accuracy. Experience flavoured recommendations tailored to your tastes, powered by Keras, Flask, and advanced face recognition algorithms with a full-fledged movie recommendation system. Read more
| PlogPayouts | AI-driven Plogging System |
Published:
Transform your daily jog into a mission for a cleaner world with PlogPayouts. Our innovative website + app rewards you for collecting litter, promoting fitness and environmental cleanliness. Utilizing AI for trash categorization and optimized routes, and fostering community through shared stories, PlogPayouts turns every step into a step towards a greener, more inclusive society. Join us and make a difference today! Read more
| Insight-Ed (HackNITR 5.0) | EdTech Platform for Student and Teacher |
Published:
Imagine an online classroom where teachers instantly know when and why students lose focus. Our AI-powered solution bridges the knowledge gap by detecting student emotions and attentiveness, highlighting problem areas, makes the teacher aware of each student’s progress. With features like reverse video search, dynamic questionnaires, and advanced Q&A bots, we transform the learning experience, making it more interactive and insightful. Redefining online education with our model, ensuring every student gets the attention they need, right when they need it. Read more
| FarmGenie (GeoHack 2024) | Empowering farmers with real-time insights and expert guidance via AI-driven space |
Published:
Our platform utilizes LLMs and a Mixture of Expert (MoE) approaches to provide precise guidance on soil management, plant disease identification, and irrigation techniques. Built as a scalable web application with a Next.js frontend and backend, and supported by a Redis queue and multiple worker nodes, FarmGenie ensures robust performance. The system’s multilingual support, interactive community forum, and up-to-date knowledge base facilitate seamless, expert-driven assistance for both new and experienced farmers. Read more
| NeatRL Playground | AI Games Showcase powered by Reinforcement Learning |
Published:
Beautiful, interactive website showcasing AI-powered games with reinforcement learning agents. Features Pong AI with Deep Q-Learning, real-time WebSocket communication, and smooth animations. Deployed on Vercel (frontend) and Render (game server) with production-ready health checks and headless mode. Read more
| Paper Replications | ML/DL Research Paper Implementations |
Published:
A comprehensive collection of code implementations replicating results from influential machine learning and deep learning research papers. Features 30+ models including Transformers, GANs, Vision models, and RLHF techniques. Read more
| NeatRL | Deep Reinforcement Learning Algorithms Library |
Published:
Comprehensive implementations of deep RL algorithms including DQN, A2C, PPO, DDPG, TD3, and SAC. Features one-file implementations, experiment tracking with W&B, automatic video recording, and support for Gymnasium environments. Main NeatRL library provides high-quality training utilities with focus on simplicity and performance. Read more
| SmolCluster | Distributed Deep Learning Library for Heterogeneous Hardware |
Published:
Educational Library for training/inference of neural networks across heterogenous compute like Mac minis, Raspberry Pi, and GPUs, written using only socket library in Python. Supports FSDP, Classic Data Parallelism, Elastic DP, Synchronous Parameter Server, and Model Parallelism. Read more
publications
rl
Q-Learning
Published:
Tabular Q-Learning and Value Iteration implemented from scratch as educational notebooks. Read more
DQN Flappy
Published:
DQN agent trained on Flappy Bird using pixel observations, experience replay, and epsilon-greedy exploration. Read more
VizDoom RL
Published:
DQN agent trained on VizDoom Basic via Gymnasium wrapper, with grayscale preprocessing, replay buffer, and W&B logging. Read more
GRPO
Published:
Group Relative Policy Optimization — DeepSeek-R1’s RL training objective implemented from scratch. Read more
DQN Frozenlake
Published:
Implementation of DQN-FrozenLake reinforcement learning algorithm Read more
Flappybird PPO
Published:
Implementation of FlappyBird-PPO reinforcement learning algorithm Read more
Frozen Lake
Published:
Implementation of Frozen-Lake reinforcement learning algorithm Read more
Imitation Learning
Published:
Implementation of Imitation Learning reinforcement learning algorithm Read more
smolhub
Smol Mixtral
Published:
A PyTorch implementation of a Mixtral inspired transformer model with Mixture of Experts (MoE), designed for text generation and understanding tasks. This model is built on the Mixtral architecture wi…
Read more
Smol Transformer
Published:
A compact implementation of an Encoder-Decoder Transformer for sequence-to-sequence translation tasks. This project implements a translation model from English to Hindi using the Samanantar dataset.
Read more
Story Kimi
Published:
A PyTorch implementation of a DeepSeek V3 inspired transformer model with Mixture of Experts (MoE), Latent Attention, and other advanced features.
Read more
Story Llama
Published:
So, I trained a Llama a 88M architecture I coded from ground up to build a small instruct model, going through the below-mentioned stages from scratch.
Read more
Story Mixtral
Published:
A PyTorch implementation of a Mixtral inspired transformer model with Mixture of Experts (MoE), Flash Attention, and other advanced features.
Read more
Smol Llama
Published:
So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model, going through the below-mentioned stages from scratch.
Read more
teaching
Teaching experience 1
Undergraduate course, University 1, Department, 2014
This is a description of a teaching experience. You can use markdown like any other post.
Heading 1
Heading 2
Heading 3
Read moreTeaching experience 2
Workshop, University 1, Department, 2015
This is a description of a teaching experience. You can use markdown like any other post.