Big Data Processing for the AI Era

Sail: An open-source computation framework with a mission to unify batch processing, stream processing, and compute-intensive AI workloads.

Documentation

Announcing Sail 0.2.1: Enhanced UDF support and better Spark compatibility. Read more

The LakeSail Slack Community is now live. Join now

Apache Spark → Sail

4x
Processing Speed
94% ↓
Hardware Costs
0
Code Changes

Powered by Rust, Sail offers stellar performance in benchmark evaluations.

Read more

The Sail architecture is designed to shape the future of distributed processing.

Read more

Get Started with Sail

Sail features a drop-in replacement for Spark SQL and the Spark DataFrame API in both single-host and distributed settings.

bash
pip install "pysail[spark]"
bash
sail spark server --port 50051
python
from pysail.spark import SparkConnectServer

server = SparkConnectServer(port=50051)
server.start(background=False)
bash
kubectl apply -f sail.yaml
kubectl -n sail port-forward service/sail-spark-server 50051:50051

Once you have a running Sail server, you can connect to it in PySpark.
No changes are needed in your PySpark code!

python
from pyspark.sql import SparkSession

spark = SparkSession.builder.remote("sc://localhost:50051").getOrCreate()
spark.sql("SELECT 1 + 1").show()

Sail Support Options

LakeSail offers commercial support for Sail, with flexible coverage tailored to your needs. Get in touch for more details.

Community

Free

  • Public issue tracker
  • Public Slack channel
  • No guaranteed response time
  • SQL and DataFrame API support only

Enterprise

Negotiable

  • Private issue tracker
  • Private Slack channel
  • Guaranteed response time
  • Custom integrations
  • Flexible scope and support
LakeSail, Inc. © 2025