Ratings and Reviews 0 Ratings

Total
ease
features
design
support

This software has no reviews. Be the first to write a review.

Write a Review

Ratings and Reviews 0 Ratings

Total
ease
features
design
support

This software has no reviews. Be the first to write a review.

Write a Review

Alternatives to Consider

  • Google Cloud Platform Reviews & Ratings
    60,933 Ratings
    Company Website
  • Greatmail Reviews & Ratings
    9 Ratings
    Company Website
  • Dragonfly Reviews & Ratings
    16 Ratings
    Company Website
  • ScalaHosting Reviews & Ratings
    2,358 Ratings
    Company Website
  • Semarchy xDM Reviews & Ratings
    64 Ratings
    Company Website
  • QuantaStor Reviews & Ratings
    6 Ratings
    Company Website
  • Teradata VantageCloud Reviews & Ratings
    1,120 Ratings
    Company Website
  • Safetica Reviews & Ratings
    415 Ratings
    Company Website
  • DataHub Reviews & Ratings
    10 Ratings
    Company Website
  • DbVisualizer Reviews & Ratings
    572 Ratings
    Company Website

What is IBM Analytics Engine?

IBM Analytics Engine presents an innovative structure for Hadoop clusters by distinctively separating the compute and storage functionalities. Instead of depending on a static cluster where nodes perform both roles, this engine allows users to tap into an object storage layer, like IBM Cloud Object Storage, while also enabling the on-demand creation of computing clusters. This separation significantly improves the flexibility, scalability, and maintenance of platforms designed for big data analytics. Built upon a framework that adheres to ODPi standards and featuring advanced data science tools, it effortlessly integrates with the broader Apache Hadoop and Apache Spark ecosystems. Users can customize clusters to meet their specific application requirements, choosing the appropriate software package, its version, and the size of the cluster. They also have the flexibility to use the clusters for the duration necessary and can shut them down right after completing their tasks. Furthermore, users can enhance these clusters with third-party analytics libraries and packages, and utilize IBM Cloud services, including machine learning capabilities, to optimize their workload deployment. This method not only fosters a more agile approach to data processing but also ensures that resources are allocated efficiently, allowing for rapid adjustments in response to changing analytical needs.

What is Deequ?

Deequ is a groundbreaking library designed to enhance Apache Spark by enabling "unit tests for data," which helps evaluate the quality of large datasets. User feedback and contributions are highly encouraged as we strive to improve the library. The operation of Deequ requires Java 8, and it is crucial to recognize that version 2.x of Deequ is only compatible with Spark 3.1, creating a dependency between the two. Users of older Spark versions should opt for Deequ 1.x, which is available in the legacy-spark-3.0 branch. Moreover, we also provide legacy releases that support Apache Spark versions from 2.2.x to 3.0.x. The Spark versions 2.2.x and 2.3.x utilize Scala 2.11, while the 2.4.x, 3.0.x, and 3.1.x releases rely on Scala 2.12. Deequ's main objective is to conduct "unit-testing" on data to pinpoint possible issues at an early stage, thereby ensuring that mistakes are rectified before the data is utilized by consuming systems or machine learning algorithms. In the upcoming sections, we will illustrate a straightforward example that showcases the essential features of our library, emphasizing its user-friendly nature and its role in preserving data quality. This example will also reveal how Deequ can simplify the process of maintaining high standards in data management.

Media

Media

Integrations Supported

Apache Spark
Acquia CDP
Galileo
Hadoop
IBM Cloud Object Storage
MINT
RadiantOne
Switch Automation
ZARUS

Integrations Supported

Apache Spark
Acquia CDP
Galileo
Hadoop
IBM Cloud Object Storage
MINT
RadiantOne
Switch Automation
ZARUS

API Availability

Has API

API Availability

Has API

Pricing Information

$0.014 per hour
Free Trial Offered?
Free Version

Pricing Information

Pricing not provided.
Free Trial Offered?
Free Version

Supported Platforms

SaaS
Android
iPhone
iPad
Windows
Mac
On-Prem
Chromebook
Linux

Supported Platforms

SaaS
Android
iPhone
iPad
Windows
Mac
On-Prem
Chromebook
Linux

Customer Service / Support

Standard Support
24 Hour Support
Web-Based Support

Customer Service / Support

Standard Support
24 Hour Support
Web-Based Support

Training Options

Documentation Hub
Webinars
Online Training
On-Site Training

Training Options

Documentation Hub
Webinars
Online Training
On-Site Training

Company Facts

Organization Name

IBM

Date Founded

1911

Company Location

United States

Company Website

www.ibm.com/cloud/analytics-engine

Company Facts

Organization Name

Deequ

Company Website

github.com/awslabs/deequ

Categories and Features

Data Discovery

Contextual Search
Data Classification
Data Matching
False Positives Reduction
Self Service Data Preparation
Sensitive Data Identification
Visual Analytics

Data Visualization

Analytics
Content Management
Dashboard Creation
Filtered Views
OLAP
Relational Display
Simulation Models
Visual Discovery

Categories and Features

Popular Alternatives

E-MapReduce Reviews & Ratings

E-MapReduce

Alibaba

Popular Alternatives

Spark Streaming Reviews & Ratings

Spark Streaming

Apache Software Foundation
Hadoop Reviews & Ratings

Hadoop

Apache Software Foundation
Apache Spark Reviews & Ratings

Apache Spark

Apache Software Foundation
Apache Sentry Reviews & Ratings

Apache Sentry

Apache Software Foundation
MLlib Reviews & Ratings

MLlib

Apache Software Foundation
Apache Mahout Reviews & Ratings

Apache Mahout

Apache Software Foundation