IMPLEMENTATION AND OPTIMIZATION OF GPT 2
Summary
Self Deep Learning Project
UNDERGRADUATE STUDENT RESEARCHER
Summary
Extracted key information from document images for visual question answering. Experimented with various transformer-based architectures for scene and text understanding like ViBERT, VILBERT, Chargrid, etc. Performed multi-modal learning for VQA (visual question answering) using layout aware scene text transformers.
GRADUATE ENGINEER TRAINEE
Summary
Developing Al-driven AMRs (Autonomous Mobile Robots), L2 Autonomous vehicle for reducing human intervention in shop floors, warehouses, and manufacturing plants. Working on deep learning architectures for V-SLAM, structure from motion, scene reconstruction, and monocular depth estimation. Worked on multiple sensor fusion (camera, lidar, GPS & IMU) for mapping and localization of autonomous vehicles, based on autoware. Finetuned YOLOv8 model on real world data captured from Intet RealSense Depth Camera, for robust pallet detection. Implemented detection and pose estimation from detected bounding boxes using docker container on Nvidia Jetson Nano. Performed multi-view object detection by minimizing epipolar error using Hungarian matching.
CLASS 12
CBSE
Grade: 95.3%
CLASS 10
ICSE
Grade: 90.17%
→
B.TECH
MECHANICAL ENGINEERING
Grade: 7.44/10
Awarded By
IIT Jodhpur
Awarded By
IIT Jodhpur
Awarded By
IIT Jodhpur
C++, C, Python, Simulink, R, Shell, Matlab, Java.
PyTorch, PyTorch Lightning, TensorFlow, ONNX, TensorRT, OpenCV, ROS1/ROS2, Open3D, Hugging-face, Ollama, SQL, Docker, Git, Scikit-Learn, Rust, SHAP, Xgboost, Lime, EDA, Tableau, Matplotlib.
AI/MLOps - AWS/Azure/GCP, Model Deployment (Docker & Kubernetes), Weights & Biases (W&B), DVC (Data Version Control), CI/CD for ML (GitHub Actions, GitLab CI), Linux shell utilities, Slurm HPC, RAG, Data Science.
Summary
Self Deep Learning Project
Summary
Self Deep Learning Project
Summary
Course Project (Advanced ML) | Dr. Mayank Vatsa