apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
apache/incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
delta-io/delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
joernio/joern
Open-source code analysis platform for C/C++/Java/Binary/Javascript/Python/Kotlin based on code property graphs. Discord https://discord.gg/vv4MH284Hc
rtyley/bfg-repo-cleaner
Removes large or troublesome blobs like git-filter-branch does, but faster. And written in Scala
lichess-org/lila
♞ lichess.org: the forever free, adless and open source chess server ♞
awslabs/deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
ucb-bar/chiseltest
The batteries-included testing and formal verification library for Chisel-based RTL designs.
apache/kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
akka/akka
A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.
gitbucket/gitbucket
A Git platform powered by Scala with easy installation, high extensibility & GitHub API compatibility