apache/incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
delta-io/delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
playframework/playframework
The Community Maintained High Velocity Web Framework For Java and Scala.
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
awslabs/deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
TheHive-Project/TheHive
TheHive: a Scalable, Open Source and Free Security Incident Response Platform
lichess-org/lila
♞ lichess.org: the forever free, adless and open source chess server ♞
scala/scala
Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3
neo4j/neo4j-spark-connector
Neo4j Connector for Apache Spark, which provides bi-directional read/write access to Neo4j from Spark, using the Spark DataSource APIs
rtyley/bfg-repo-cleaner
Removes large or troublesome blobs like git-filter-branch does, but faster. And written in Scala