lpcinelli

Lucas Cinelli lpcinelli

17 followers · 10 following

@octos-ai
Rio de Janeiro, Brazil

Achievements

Stars

serengil / deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 20,828 2,826 Updated Oct 21, 2025

The-AI-Alliance / GEO-Bench-VLM

GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks

Python 80 6 Updated Jul 1, 2025

Ravi-Teja-konda / Surveillance_Video_Summarizer

VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for …

Python 125 16 Updated Jun 6, 2025

AlexanderMelde / SPHAR-Dataset

Surveillance Perspective Human Action Recognition Dataset: 7759 Videos from 14 Action Classes, aggregated from multiple sources, all cropped spatio-temporally and filmed from a surveillance-camera …

Python 110 22 Updated Apr 2, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,287 3,867 Updated Oct 22, 2025

merveenoyan / smol-vision

Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜

Jupyter Notebook 1,640 130 Updated Sep 12, 2025

OpenGVLab / vinci

Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model

Python 75 2 Updated Jan 13, 2025

zai-org / CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,419 161 Updated Mar 3, 2025

showlab / videollm-online

VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)

Python 563 57 Updated Sep 2, 2025

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

16,519 1,067 Updated Oct 20, 2025

NVlabs / VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,612 299 Updated Oct 20, 2025

vast-data / mattsvlm

Processing Video POC with Multimodal LLMs

Python 17 4 Updated May 12, 2025

byjlw / video-analyzer

Analyze videos using LLMs, Computer Vision and Automatic Speech Recognition

Python 1,084 149 Updated Apr 23, 2025

insight-platform / Savant

Python Computer Vision & Video Analytics Framework With Batteries Included

Python 709 65 Updated Oct 20, 2025

layumi / Person_reID_baseline_pytorch

⛹️ Pytorch ReID: A tiny, friendly, strong pytorch implement of person re-id / vehicle re-id baseline. Tutorial 👉https://github.com/layumi/Person_reID_baseline_pytorch/tree/master/tutorial

Python 4,347 1,023 Updated May 7, 2025

fixie-ai / ultravox

A fast multimodal LLM for real-time voice

Python 4,229 337 Updated Sep 2, 2025

bluenviron / mediamtx

Ready-to-use SRT / WebRTC / RTSP / RTMP / LL-HLS / MPEG-TS / RTP media server and media proxy that allows to read, publish, proxy, record and playback video and audio streams.

Go 16,738 1,995 Updated Oct 22, 2025

rafaelpadilla / telepic

A lightweight web application for remotely viewing images from a remote computer through a web browser. 🖼️

Python 7 Updated Apr 8, 2025

Ed1sonChen / Clip2Safety

Implementation of paper "Vision Language Model for Interpretable and Fine-grained Detection of Safety Compliance in Diverse Workplaces"

Jupyter Notebook 10 Updated Jan 17, 2025

awslabs / amazon-kinesis-video-streams-webrtc-sdk-c

Amazon Kinesis Video Streams Webrtc SDK is for developers to install and customize realtime communication between devices and enable secure streaming of video, audio to Kinesis Video Streams.

C 1,140 364 Updated Oct 9, 2025

NVlabs / LITA

Python 186 13 Updated Oct 14, 2024

superglue-ai / superglue

superglue (YC W25) builds integrations and tools from natural language. Get production-grade tools for long tail and enterprise systems.

TypeScript 1,914 105 Updated Oct 22, 2025

jinnh / GSAD

[NeurIPS 2023] Global Structure-Aware Diffusion Process for Low-Light Image Enhancement

Python 157 10 Updated Aug 6, 2024

IDEA-Research / DINO-X-API

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python 1,254 55 Updated Jul 23, 2025

agentjido / jido

🤖 Autonomous agent framework for Elixir. Built for distributed, autonomous behavior and dynamic workflows.

Elixir 668 39 Updated Oct 20, 2025

aws-samples / amazon-sagemaker-multiple-object-tracking

Python 15 7 Updated Oct 9, 2023

comet-ml / opik

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

Python 14,975 1,116 Updated Oct 22, 2025

Yuchen413 / AnomalyRuler

Implementation for paper "Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Model"

Python 94 14 Updated Dec 16, 2024

rafaelpadilla / image_dedupe

Python 8 Updated Nov 15, 2024

vanna-ai / vanna

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

Python 20,893 1,942 Updated Oct 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lucas Cinelli lpcinelli

Achievements

Achievements

Block or report lpcinelli

Stars

serengil / deepface

The-AI-Alliance / GEO-Bench-VLM

Ravi-Teja-konda / Surveillance_Video_Summarizer

AlexanderMelde / SPHAR-Dataset

unslothai / unsloth

merveenoyan / smol-vision

OpenGVLab / vinci

zai-org / CogVLM2

showlab / videollm-online

BradyFU / Awesome-Multimodal-Large-Language-Models

NVlabs / VILA

vast-data / mattsvlm

byjlw / video-analyzer

insight-platform / Savant

layumi / Person_reID_baseline_pytorch

fixie-ai / ultravox

bluenviron / mediamtx

rafaelpadilla / telepic

Ed1sonChen / Clip2Safety

awslabs / amazon-kinesis-video-streams-webrtc-sdk-c

NVlabs / LITA

superglue-ai / superglue

jinnh / GSAD

IDEA-Research / DINO-X-API

agentjido / jido

aws-samples / amazon-sagemaker-multiple-object-tracking

comet-ml / opik

Yuchen413 / AnomalyRuler

rafaelpadilla / image_dedupe

vanna-ai / vanna