An advanced AI-powered video surveillance and analysis system that provides automated monitoring, visual Q&A, and real-time video captioning capabilities for CCTV cameras and video feeds.
- Support for multiple video input sources:
- Local video files
- Webcam feeds
- RTSP streams from IP cameras
- Real-time video captioning using Salesforce BLIP model
- Natural language visual Q&A using VILT model
- Interactive Streamlit interface
- Continuous monitoring and analysis