Project Showcase
GitHub

LaneIQ

A reinforcement learning project for lane-level driving decisions, short-horizon world modeling, and V2X-informed traffic awareness.

System Summary

Project Overview

LaneIQ is a small driving research project built around one core task: learning when to stay in lane, stop, or merge in traffic. The agent trains in a three-lane simulator with moving cars and receives local traffic features plus V2X-style lane risk scores.

That interaction data is then reused to train a one-step world model, which predicts what happens next for a given state and action.

High-level LaneIQ system diagram showing the RL loop, the training dataset, the world model, V2X input, and planning.
High-level pipeline: RL training generates transitions, the world model learns from that dataset, and planning combines model predictions with V2X context.
Reinforcement Learning

Policy and Environment Design

The reinforcement learning component is framed as lane-level tactical decision-making in a three-lane traffic simulator. The agent receives a compact local state, chooses from discrete lane-plus-motion actions, and is optimized to balance progress, safety, and merge discipline.

Observation Space

Action Space

Reward Design

Learning Objective

Environment Generation

Predictive Modeling

World Model and Training Data

The predictive model is trained on transition tuples collected from policy-environment interaction. Its role is to approximate one-step dynamics, reward, and episode completion so that candidate action sequences can be evaluated before execution.

Training Data Schema

Sample Transition

Optimization Targets

Saved Model Evidence

Showcase

The sections below are generated from saved artifacts rather than live execution. They show one fixed-sample policy rollout and one imagination example in which the world model predicts short-horizon futures and those predictions are compared against the real simulator.

Fixed-Sample Policy Rollout

Full Rollout Log

Imagination Rollouts

Candidate Futures Ranked by Predicted Return

Predicted vs Real Environment