Abhay Narayan Dwivedi abhaydwived

Hi, I'm Abhay Narayan Dwivedi

RL Researcher · Deep Learning · Computer Vision
B.Tech Mathematics & Computing, Central University of Karnataka (2023–2027)

About me

I work at the intersection of Reinforcement Learning, robotics, and applied Deep Learning. My research focuses on biped locomotion, Obstacle Avoidance, and LLM-guided reward shaping. I enjoy building systems that are both theoretically grounded and practically deployable — from simulation environments to real-time computer vision applications.

Currently working on: LLM-driven RL reward design.

Publications

Learning Multi-Skill Locomotion in Underactuated Biped: A Waypoint-Based Reward Shaping Approach
Published — IEEE ICC 2025 (International Peer-Reviewed Conference)

UAV-Assisted Navigation of an Underactuated Biped via Soft Actor-Critic Reinforcement Learning
Under review — IJCAI-ECAI 2026 (Tier-1 AI Conference)

Featured projects

LLM-guided reward shaping for bipedal locomotion

Closed-loop GPT-4 pipeline that autonomously refines RL reward functions across iterations — achieving ~69× improvement in forward locomotion distance and 67% reduction in torso tilt.
GPT-4 SAC PyBullet Gymnasium NumPy

RL biped locomotion — IIT Mandi

6–8 DOF underactuated biped trained with SAC, TD3, and DDPG over 10M+ steps. Integrated A* global path planning for hierarchical obstacle avoidance — 94% navigation success, 2% fall rate.
PyBullet SAC TD3 DDPG A* Gymnasium

Face recognition attendance system

Real-time contactless attendance with a 128-d face encoding pipeline, liveness detection (anti-spoofing via YuNet + custom ONNX model), and a secure admin dashboard with live video streaming.
Flask OpenCV dlib face_recognition ONNX SQLite

MNIST neural network from scratch

Fully connected network built with NumPy only — manual forward/backward propagation, softmax activation, cross-entropy loss, and mini-batch gradient descent. 90%+ test accuracy.
NumPy Matplotlib

Hand tracking virtual painter

Real-time gesture-controlled drawing system using hand landmark detection. Supports drawing, erasing, brush control, and color selection via gesture recognition.
OpenCV MediaPipe NumPy

Skills

Languages — Python, C++, JavaScript, SQL, Matlab, R
ML / DL — PyTorch, TensorFlow, Keras, Scikit-learn, NumPy, Pandas
RL / Simulation — Gymnasium, PyBullet, MuJoCo, Isaac Gym
Computer Vision — OpenCV, MediaPipe, dlib, ONNX
Tools — Git, Flask, SQLite, Power BI

Connect

Open to research internships, collaborations, and interesting problems in RL, robotics, and deep learning.

📧 abhaydwivedi10122005@gmail.com

Provide feedback

Saved searches

Use saved searches to filter your results more quickly