Ratings and Reviews 25 Ratings
Ratings and Reviews 0 Ratings
What is StarTree?
StarTree Cloud functions as a fully-managed platform for real-time analytics, optimized for online analytical processing (OLAP) with exceptional speed and scalability tailored for user-facing applications. Leveraging the capabilities of Apache Pinot, it offers enterprise-level reliability along with advanced features such as tiered storage, scalable upserts, and a variety of additional indexes and connectors. The platform seamlessly integrates with transactional databases and event streaming technologies, enabling the ingestion of millions of events per second while indexing them for rapid query performance. Available on popular public clouds or for private SaaS deployment, StarTree Cloud caters to diverse organizational needs. Included within StarTree Cloud is the StarTree Data Manager, which facilitates the ingestion of data from both real-time sources—such as Amazon Kinesis, Apache Kafka, Apache Pulsar, or Redpanda—and batch data sources like Snowflake, Delta Lake, Google BigQuery, or object storage solutions like Amazon S3, Apache Flink, Apache Hadoop, and Apache Spark. Moreover, the system is enhanced by StarTree ThirdEye, an anomaly detection feature that monitors vital business metrics, sends alerts, and supports real-time root-cause analysis, ensuring that organizations can respond swiftly to any emerging issues. This comprehensive suite of tools not only streamlines data management but also empowers organizations to maintain optimal performance and make informed decisions based on their analytics.
What is Apache DataFusion?
Apache DataFusion is a highly adaptable and capable query engine developed in Rust, which utilizes Apache Arrow for efficient in-memory data handling. It is intended for developers who are working on data-centric systems, including databases, data frames, machine learning applications, and real-time data streaming solutions. Featuring both SQL and DataFrame APIs, DataFusion offers a vectorized, multi-threaded execution engine that efficiently manages data streams while accommodating a variety of partitioned data sources. It supports numerous native file formats, including CSV, Parquet, JSON, and Avro, and integrates seamlessly with popular object storage services such as AWS S3, Azure Blob Storage, and Google Cloud Storage. The architecture is equipped with a sophisticated query planner and an advanced optimizer, which includes features like expression coercion, simplification, and distribution-aware optimizations, as well as automatic join reordering for enhanced performance. Additionally, DataFusion provides significant customization options, allowing developers to implement user-defined scalar, aggregate, and window functions, as well as integrate custom data sources and query languages, thereby enhancing its utility for a wide range of data processing scenarios. This flexibility ensures that developers can effectively adjust the engine to meet their specific requirements and optimize their data workflows.
Integrations Supported
Amazon S3
Amazon Kinesis
Amazon Redshift
Apache Avro
Apache Kafka
Apache Parquet
Apache Pulsar
Confluent
Databricks Data Intelligence Platform
Embeddable
Integrations Supported
Amazon S3
Amazon Kinesis
Amazon Redshift
Apache Avro
Apache Kafka
Apache Parquet
Apache Pulsar
Confluent
Databricks Data Intelligence Platform
Embeddable
API Availability
Has API
API Availability
Has API
Pricing Information
Pricing not provided.
Free Trial Offered?
Free Version
Pricing Information
Free
Free Trial Offered?
Free Version
Supported Platforms
SaaS
Android
iPhone
iPad
Windows
Mac
On-Prem
Chromebook
Linux
Supported Platforms
SaaS
Android
iPhone
iPad
Windows
Mac
On-Prem
Chromebook
Linux
Customer Service / Support
Standard Support
24 Hour Support
Web-Based Support
Customer Service / Support
Standard Support
24 Hour Support
Web-Based Support
Training Options
Documentation Hub
Webinars
Online Training
On-Site Training
Training Options
Documentation Hub
Webinars
Online Training
On-Site Training
Company Facts
Organization Name
StarTree
Date Founded
2019
Company Location
United States
Company Website
www.startree.ai/product
Company Facts
Organization Name
Apache Software Foundation
Date Founded
2019
Company Location
United States
Company Website
datafusion.apache.org
Categories and Features
Big Data
Collaboration
Data Blends
Data Cleansing
Data Mining
Data Visualization
Data Warehousing
High Volume Processing
No-Code Sandbox
Predictive Analytics
Templates
Data Analysis
Data Discovery
Data Visualization
High Volume Processing
Predictive Analytics
Regression Analysis
Sentiment Analysis
Statistical Modeling
Text Analytics
Data Management
Customer Data
Data Analysis
Data Capture
Data Integration
Data Migration
Data Quality Control
Data Security
Information Governance
Master Data Management
Match & Merge
Database
Backup and Recovery
Creation / Development
Data Migration
Data Replication
Data Search
Data Security
Database Conversion
Mobile Access
Monitoring
NOSQL
Performance Analysis
Queries
Relational Interface
Virtualization
Streaming Analytics
Data Enrichment
Data Wrangling / Data Prep
Multiple Data Source Support
Process Automation
Real-time Analysis / Reporting
Visualization Dashboards
Categories and Features
Database
Backup and Recovery
Creation / Development
Data Migration
Data Replication
Data Search
Data Security
Database Conversion
Mobile Access
Monitoring
NOSQL
Performance Analysis
Queries
Relational Interface
Virtualization