Skip to content
View abhaydwived's full-sized avatar

Organizations

@CUK-COMMIT

Block or report abhaydwived

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
abhaydwived/README.md

Hi, I'm Abhay Narayan Dwivedi

RL Researcher · Deep Learning · Computer Vision
B.Tech Mathematics & Computing, Central University of Karnataka (2023–2027)

IEEE ICC 2025 IJCAI-ECAI 2026 IIT Mandi


About me

I work at the intersection of Reinforcement Learning, robotics, and applied Deep Learning. My research focuses on biped locomotion, Obstacle Avoidance, and LLM-guided reward shaping. I enjoy building systems that are both theoretically grounded and practically deployable — from simulation environments to real-time computer vision applications.

Currently working on: LLM-driven RL reward design.

Publications

Learning Multi-Skill Locomotion in Underactuated Biped: A Waypoint-Based Reward Shaping Approach
Published — IEEE ICC 2025 (International Peer-Reviewed Conference)

UAV-Assisted Navigation of an Underactuated Biped via Soft Actor-Critic Reinforcement Learning
Under review — IJCAI-ECAI 2026 (Tier-1 AI Conference)


Featured projects

LLM-guided reward shaping for bipedal locomotion

Closed-loop GPT-4 pipeline that autonomously refines RL reward functions across iterations — achieving ~69× improvement in forward locomotion distance and 67% reduction in torso tilt.
GPT-4 SAC PyBullet Gymnasium NumPy

RL biped locomotion — IIT Mandi

6–8 DOF underactuated biped trained with SAC, TD3, and DDPG over 10M+ steps. Integrated A* global path planning for hierarchical obstacle avoidance — 94% navigation success, 2% fall rate.
PyBullet SAC TD3 DDPG A* Gymnasium

Face recognition attendance system

Real-time contactless attendance with a 128-d face encoding pipeline, liveness detection (anti-spoofing via YuNet + custom ONNX model), and a secure admin dashboard with live video streaming.
Flask OpenCV dlib face_recognition ONNX SQLite

MNIST neural network from scratch

Fully connected network built with NumPy only — manual forward/backward propagation, softmax activation, cross-entropy loss, and mini-batch gradient descent. 90%+ test accuracy.
NumPy Matplotlib

Hand tracking virtual painter

Real-time gesture-controlled drawing system using hand landmark detection. Supports drawing, erasing, brush control, and color selection via gesture recognition.
OpenCV MediaPipe NumPy


Skills

Languages — Python, C++, JavaScript, SQL, Matlab, R
ML / DL — PyTorch, TensorFlow, Keras, Scikit-learn, NumPy, Pandas
RL / Simulation — Gymnasium, PyBullet, MuJoCo, Isaac Gym
Computer Vision — OpenCV, MediaPipe, dlib, ONNX
Tools — Git, Flask, SQLite, Power BI


Connect

Open to research internships, collaborations, and interesting problems in RL, robotics, and deep learning.

📧 abhaydwivedi10122005@gmail.com
LinkedIn GitHub Portfolio

Pinned Loading

  1. Learning_Multi-Skill-Locomotion-in-Underactuated-Biped Learning_Multi-Skill-Locomotion-in-Underactuated-Biped Public

    Benchmarking SAC, TD3, and DDPG on multi-skill bipedal locomotion using progressive waypoint-based reward shaping in PyBullet. Published at IEEE ICC 2025.

    Python 1

  2. Underactuated-Biped-Obstacle-Avoidance-using-Deep-Reinforcement-Learning Underactuated-Biped-Obstacle-Avoidance-using-Deep-Reinforcement-Learning Public

    This project presents a robust and energy-efficient obstacle avoidance framework for an 8-DOF bipedal robot using Deep Reinforcement Learning (Soft Actor-Critic). By tightly integrating an A* plann…

    Python 1

  3. Face-Recognition-Attendance-system Face-Recognition-Attendance-system Public

    A real-time face recognition-based attendance system built with Flask, OpenCV, and face_recognition. This project enables automatic attendance marking, user management, live monitoring, and reporti…

    Python

  4. LLM-Guided-Reinforcement-Learning-for-BipedalWalker-v3 LLM-Guided-Reinforcement-Learning-for-BipedalWalker-v3 Public

    Automated LLM-Guided Reinforcement Learning Testbed. This project leverages the modern BipedalWalker-v3 environment from Gymnasium to orchestrate a continuous cycle of agent training and intellige…

    Python