Stars
ONNX implementation of the BGE-M3 multilingual embedding model and tokenizer with native C#, Java, and Python implementations. Generates all three embedding types: dense, sparse, and ColBERT vectors.
RAG architecture: index and query any data using LLM and natural language, track sources, show citations, asynchronous memory patterns.
Inference Stable Diffusion with C# and ONNX Runtime
[Unofficial] Simple .NET wrapper of HuggingFace Tokenizers library
Typescript and .NET implementation of BPE tokenizer for OpenAI LLMs.
.NET/C# binding for Baidu paddle inference library and PaddleOCR
基于PaddleOCR重构,并且脱离PaddlePaddle深度学习训练框架的轻量级OCR,推理速度超快 —— A lightweight OCR system based on PaddleOCR, decoupled from the PaddlePaddle deep learning training framework, with ultra-fast inference speed.
MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
A framework for building messaging apps with .NET and C#.
Help developers to easily get started with GitHub Action workflows to deploy to Azure
Deployment of multiple linux containers in Azure as Azure Container Instance using Docker Compose
Azure DevOps Pipelines Self Hosted Agent in Azure Container Instance with basic private VNET connectivity
OpenAPI (f.k.a Swagger) Specification code generator. Supports C#, PowerShell, Go, Java, Node.js, TypeScript, Python
Ultra lightweight API server to convert files (.pdf, .docx, .xlsx) into formatted markdown.
A document analysis tool built with Streamlit and Microsoft MarkItDown. Extract and analyze content from multiple document formats with optional GPT-4o enhancement.
opengovsg / pdf2md
Forked from jzillmann/pdf-to-markdownA PDF to Markdown converter
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Convert PDF to HTML without losing text or format.
GitHub Actions Sample to deploy to Azure Functions on ACA hosting
Azure Functions on Container Apps. Bicep templates and Azure Pipelines included.
Docs , samples and issues for Azure Functions on Azure Container Apps
Hybrid AI orchestration stack combining local LLMs (Ollama), vector search (Qdrant), and Azure AI Foundry for scalable RAG, Agentic AI, and Vision. Built with .NET 8 and Python.
The GPT-RAG Data Ingestion service automates processing of diverse documents—PDFs, images, spreadsheets, transcripts, and SharePoint—readying them for Azure AI Search. It applies smart chunking, ge…
A fork of FluentAssertions controlled by the community.