The slide outlines a five-phase workflow for user focus monitoring: Calibration (Mediapipe baseline), Tracking (OpenCV real-time), Detection (Mediapipe/YOLOv8 distractions), Correlation (fusion engine), and Feedback (Gemini AI scores). It details descriptions and key components for each phase in a tabular format.
Architecture & Process Flow
{ "headers": [ "Phase", "Description", "Key Components" ], "rows": [ [ "Calibration", "Establish baseline for user behavior (face/gaze/posture) and environment.", "Mediapipe, initial webcam/mic setup" ], [ "Tracking", "Real-time monitoring of user and surroundings.", "OpenCV, webcam, microphone, Sounddevice" ], [ "Detection", "Identify distractions: gaze deviation, posture changes, objects, sounds.", "Mediapipe, YOLOv8" ], [ "Correlation", "Fuse dual pipelines to correlate behavior with environmental factors.", "Fusion engine" ], [ "Feedback", "Generate focus score (0-100%) and instant alerts.", "Gemini AI" ] ] }
Source: Dual AI pipeline: User behavior (face/gaze/posture) + Environment (objects/sounds). Fusion engine. Flow: Calibration → Tracking → Detection → Correlation → Feedback.