apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
apache/incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
zio/zio
ZIO — A type-safe, composable library for async and concurrent programming in Scala
joernio/joern
Open-source code analysis platform for C/C++/Java/Binary/Javascript/Python/Kotlin based on code property graphs. Discord https://discord.gg/vv4MH284Hc
rtyley/bfg-repo-cleaner
Removes large or troublesome blobs like git-filter-branch does, but faster. And written in Scala
awslabs/deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
TheHive-Project/TheHive
TheHive: a Scalable, Open Source and Free Security Incident Response Platform
zio/zio-http
A next-generation Scala framework for building scalable, correct, and efficient HTTP clients and servers
apache/incubator-livy
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
com-lihaoyi/mill
Mill is a fast JVM build tool that supports Java, Scala, Kotlin and many other languages. 2-4x faster than Gradle and 4-10x faster than Maven for common workflows, Mill aims to make your project’s build process performant, maintainable, and flexible
gitbucket/gitbucket
A Git platform powered by Scala with easy installation, high extensibility & GitHub API compatibility