Designer

Developer

Yadin.

Urban Air Quality
Project 02

URBAN AIR
QUALITY

Scroll
Domain

Data Science / ML

Institution

NYIT

Year

2024

Stack

Python / SUMO / XGBoost

AI-powered solution integrating Machine Learning and SUMO to predict and reduce urban air pollution in real-time.

Engineering Strategy

The architecture leverages SUMO (Simulation of Urban Mobility) to generate high-fidelity traffic emission datasets. This data is then processed through a multi-stage ML pipeline to identify spatial-temporal pollution patterns with 85% predictive accuracy.

Optimization Logic

By applying gradient-boosted trees (XGBoost) and deep neural networks, we developed an optimization algorithm that dynamically redirects traffic flow. Simulations showed a theoretical 75% reduction in CO2 and NOx concentrations at critical intersections.

Engineering Arsenal

Tech Stack

Python
Python
Pandas
Pandas
NumPy
NumPy
Scikit-Learn
Scikit-Learn
XGBoost
XGBoost
TensorFlow
TensorFlow
SUMO
SUMO
DS Methodology

Data Pipeline &
Model Training

01

Simulation & Generation

Utilizing SUMO to simulate 24-hour traffic cycles in Flushing, NY, generating granular emission data for CO, CO2, and NOx.

02

Feature Engineering

Preprocessing raw XML outputs into structured dataframes. Engineering temporal features (hour, day) and spatial clusters for hotspot analysis.

03

Hyperparameter Tuning

Implementing GridSearch and RandomSearch cross-validation to optimize model parameters, specifically focal loss for imbalanced pollution spikes.

Data Visualization
Validation Metrics
0.85
R² Score
12%
MAPE error
Environment Simulation

Dynamic Traffic & Emission Data

Model Accuracy85%
Pollution Mitigated75%
LatencyReal-time
Next Project

Cafe Mandalay