apache / spark
Apache Spark - A unified analytics engine for large-scale data processing
See what the GitHub community is most excited about this month.
Apache Spark - A unified analytics engine for large-scale data processing
Chisel: A Modern Hardware Design Language
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
♞ lichess.org: the forever free, adless and open source chess server ♞
The Scala 3 compiler, also known as Dotty.
Open-source high-performance RISC-V processor
Protocol buffer compiler for Scala.
An Agile RISC-V SoC Design Framework with in-order cores, out-of-order cores, accelerators, and more
Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3
A next-generation Scala framework for building scalable, correct, and efficient HTTP clients and servers
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.
Spark: The Definitive Guide's Code Repository
SonicBOOM: The Berkeley Out-of-Order Machine
sbt, the interactive build tool
RISC-V Torture Test
Berkeley's Spatial Array Generator
Rocket Chip Generator
A fault tolerant, protocol-agnostic RPC system
A Git platform powered by Scala with easy installation, high extensibility & GitHub API compatibility
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.