Video Input
Raw video ingestion.
Multi-model AI pipeline analyzing video frames across 22 safety categories with explainability.
~0.5–1s per frame processing with multi-user support.
Diagram focused on system boundaries, data flow, and integration responsibilities.
Raw video ingestion.
OpenCV
Frame sampling & preprocessing.
PyTorch, Transformers
Parallel model execution.
22 safety categories classification.
Combines multi-model outputs into structured result.
MongoDB
Stores results + metadata.
Streamlit
Visualization + inspection.