Alternatives to Alibaba Cloud DataHub
Compare Alibaba Cloud DataHub alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to Alibaba Cloud DataHub in 2026. Compare features, ratings, user reviews, pricing, and more from Alibaba Cloud DataHub competitors and alternatives in order to make an informed decision for your business.
-
1
DataHub
DataHub
DataHub Cloud is an event-driven AI & Data Context Platform that uses active metadata for real-time visibility across your entire data ecosystem. Unlike traditional data catalogs that provide outdated snapshots, DataHub Cloud instantly propagates changes, automatically enforces policies, and connects every data source across platforms with 100+ pre-built connectors. Built on an open source foundation with a thriving community of 13,000+ members, DataHub gives you unmatched flexibility to customize and extend without vendor lock-in. DataHub Cloud is a modern metadata platform with REST and GraphQL APIs that optimize performance for complex queries, essential for AI-ready data management and ML lifecycle support. -
2
DataHub
DataHub
We help organizations of all sizes to design, develop and scale solutions to manage their data and unleash its potential. At Datahub, we have over thousands of datasets for free and a Premium Data Service for additional or customised data with guaranteed updates. Datahub provides important, commonly-used data as high quality, easy-to-use and open data packages. Securely share and elegantly put data online with quality checks, versioning, data APIs, notifications & integrations. Power and simplicity, data is the fastest way for individuals, teams and organizations to publish, deploy and share structured data. Automate your data processes with our open source framework. Store, share and showcase your data with the world or just privately. Completely open source with professional maintenance and support. End-to-end solution with all parts are fully integrated. Not just tools but a standardized approach and pattern for working with your data. -
3
Striim
Striim
Data integration for your hybrid cloud. Modern, reliable data integration across your private and public cloud. All in real-time with change data capture and data streams. Built by the executive & technical team from GoldenGate Software, Striim brings decades of experience in mission-critical enterprise workloads. Striim scales out as a distributed platform in your environment or in the cloud. Scalability is fully configurable by your team. Striim is fully secure with HIPAA and GDPR compliance. Built ground up for modern enterprise workloads in the cloud or on-premise. Drag and drop to create data flows between your sources and targets. Process, enrich, and analyze your streaming data with real-time SQL queries. -
4
DataHUB+
VROC
DataHUB+ is a next generation data historian, for real-time monitoring of assets and systems across an entire network. Built in analytics and data visualization tools allow for rapid insights so you have a clear overview of what is happening in your plant, facility or city at all times. Equipment agnostic ensures that data can be integrated from any IoT device, sensor or piece of equipment. Data is stored securely and reliably, eliminating duplicated data throughout the organization, with DataHUB+ becoming the source of truth. DataHub+ doesn’t rely on expensive IT infrastructure like traditional process data historians. DataHUB+ automatically checks the data quality as it is ingested, alerting teams if there are problems with the data quality. The automatic pre-processing of data, means it is ready for data analytics and AI, eliminating data wrangling. Teams can produce reports, get alerts and easily track KPIs using DataHUB+. Let your data power your future. -
5
ETL DataHub
ETL
DataHub from ETL Solutions is an enterprise-grade data integration, orchestration, and management platform designed to help organizations connect, harmonize, and operationalize data from diverse sources into a unified, governed, and accessible ecosystem. It enables seamless ingestion and transformation of structured and unstructured data through pre-built connectors and mappings, automated workflows, change data capture, and real-time data pipelines that support analytics, reporting, and AI/ML use cases. Built for hybrid and multi-cloud environments, DataHub centralizes metadata and business logic while enforcing data governance, lineage, and quality controls so stakeholders can trust and act on enterprise data. Its orchestration engine handles complex dependencies and schedules, ensuring data arrives on time and maintains consistency across systems. -
6
Alibaba Cloud Data Integration
Alibaba
Alibaba Cloud Data Integration is a comprehensive data synchronization platform that facilitates both real-time and offline data exchange across various data sources, networks, and locations. It supports data synchronization between more than 400 pairs of disparate data sources, including RDS databases, semi-structured storage, non-structured storage (such as audio, video, and images), NoSQL databases, and big data storage. The platform also enables real-time data reading and writing between data sources such as Oracle, MySQL, and DataHub. Data Integration allows users to schedule offline tasks by setting specific trigger times, including year, month, day, hour, and minute, simplifying the configuration of periodic incremental data extraction. It integrates seamlessly with DataWorks data modeling, providing an operations and maintenance integrated workflow. The platform leverages the computing capability of Hadoop clusters to synchronize HDFS data to MaxCompute. -
7
NeoXam DataHub
NeoXam
The Single Point of Truth for data used or produced by financial institutions. NeoXam DataHub provides a set of functional modules which answer to the specific requirements of financial institutions such as investment and retail banks, asset managers, brokers, custodians or fund administrators. Consolidation and centralization of a securities master file fed from different sources, improved management of business entities (counterparties, issuers), the creation of a unique customer master file, integration of all trades and positions in a unique repository for better risk and compliance monitoring are only a sample of the issues that NeoXam DataHub is able to address. -
8
WESL DATAHUB
Whiteland Engineering Software
WESL DATAHUB was designed over fifteen years ago out of business necessity by Whiteland Engineering Ltd., who required a software solution which would manage and control their sub-contract precision machining business. WESL DATAHUB is a fully customizable and affordable E.R.P business solution for every user from the smallest SME to the more sizable clients with both benefitting from the part user license option. WESL DATAHUB Enterprise Resource Planning (E.R.P) and Administration Software is designed to manage all aspects of your business from estimating through to accounting with the added ‘ease of use’ functionality making it both an effective and efficient business tool. WESL DATAHUB is a proficient E.R.P solution for the field of Engineering/Manufacturing and through our progressive development process it is now also able to be implemented within a broad range of other industries. -
9
Figment
Figment
Actively participating in network proposals and providing a voice to token holders in governance matters. Offering in-depth reporting of staking rewards for tax and compliance optimization. Building on Web 3 shouldn't be hard. DataHub eliminates the hassle of running your own infrastructure so that you can focus on building. View proposals and participate in on-chain governance via Hubble. View transactional and staking data updated in real-time, as well as all historical validator and staking data. Learn the basics of new protocols and discover the perfect network for your DApp. Figment operates a highly secure network of Proof-of-Stake (PoS) validators that enable token holders to secure networks, participate in governance, and earn yield. Figment’s DataHub platform lets developers use the most powerful and unique features of a blockchain without having to become protocol experts, accelerating the development of new Web 3 applications. -
10
Knoema
Knoema
Search, discover, catalog and access your data seamlessly. Knoema’s DataHub solves enterprise workflow challenges across all areas of the business by being the lens on top of any enterprise’s data assets. 8x reduction in time to value in comparison to internal build. Seamless connectivity to internal and third-party data. Fast and simple search to discover new data. Data accessibility during cloud adoption and digital transformation. Our catalog continues to grow through new datasets every day. Find 1st party, public, or 3rd party data with ease. Add new data subscriptions without the overhead. Filter across your data, your 3rd party licensed data, and new data that is pre-integrated into Knoema to get the right data for your needs. Insight and action based on unique user workflows. Foster and achieve organizational data literacy. Integrate and embed insights into other solutions. Track action and usage with data governance tools. -
11
Cogent DataHub
Skkynet
There is a growing need to securely operate on and utilize industrial data in every industry vertical. Skkynet's Cogent DataHub provides a secure-by-design industrial data operations platform that connects to, provides inline protocol conversion for, aggregates, contextualizes, edge processes, integrates with AI models, visualizes and securely streams industrial data to where ever it is needed - in OT, IT, or the cloud. Powered by patented technology and supported by decades of industrial expertise, Skkynet’s proven software is trusted by over 2,200 customers across more than 30,000 installations in 86 countries.Starting Price: $495/month - unlimited data -
12
Damoov
Damoov
Damoov provides mobile telematics as a service for teams that need to embed trip tracking, driver behavior analytics, and safe-driving scoring into mobile apps. The smartphone-based approach requires no extra hardware: the Telematics SDK captures sensor data, performs on-device preprocessing, and turns it into structured trip datasets. In the cloud, Damoov’s DataHub ingests, validates, enriches, and analyzes telematics data, while APIs deliver trips, events, and scores to your dashboards and workflows. Support configurable tracking modes (automatic, manual, on-demand, scheduled), incident detection, and risk segmentation for UBI, fleet/transportation, shared mobility, gig platforms, and driver coaching.Starting Price: $250 per month -
13
Indexima Data Hub
Indexima
Reshape your perception of time in data analytics. Instantly access your business’ data in no time and work directly on your dashboard without going back and forth with the IT team. Meet Indexima DataHub, a new space-time where operational and functional users gain instant access to their data, in no time. With a combination of its unique indexing engine and machine learning, Indexima allows businesses to access all their data to simplify and speed up analytics. Robust and scalable, the solution allows organizations to query all their data directly at the source, in volumes of tens of billions of rows in just a few milliseconds. Our Indexima platform allows users to implement instant analytics on all their data in just one click. Thanks to Indexima’s new ROI and TCO calculator, find out in 30 seconds the ROI of your data platform. Infrastructure costs, project deployment time, and data engineering costs, while boosting your analytical performances.Starting Price: $3,290 per month -
14
SkkyHub
Skkynet
For most IoT services, the cloud is an end point. With SkkyHub™, the cloud becomes a way to stream your data from wherever you have it to wherever you need it. Connect OT to IT, do M2M, or link remote locations, all streaming in real time—just microseconds over network latencies. Stream data from your devices or plants for monitoring, or stream commands, updates and configuration back to your system, or both. The DataHub gateway and ETK-enabled endpoints use the DHTP protocol to ensure a data-only connection. No VPNs means that your OT and IT networks remain untouched. Outbound connections via DHTP keep all in-bound firewall ports closed. There are no exposed attack surfaces at your facility, device, or office. Get the full picture by streaming up to 100,000 data points in real time. Three service types, Basic, Standard, and Professional, let you choose the level of service you want at a price that fits your budget.Starting Price: $99.95 per month -
15
Feedier
Alkaweb
Who likes to take a survey? No one. Feedier is a new innovative platform to collect valuable Feedback. Stay leader, turn feedback into growth leverage by making data-driven decisions to improve your services and products. Innovative forms: Deploy innovative forms in minutes with a unique model: S.I.R.A. Measure Satisfaction, collect valuable Insights, Reward to create loyalty, and finally push an Action to create engagement. Get more responses: Push highly targeted and unique feedback requests that not only deliver a much better and quicker experience but also incentivize your participants to give their opinion. Empower your data: Feedier act as data-hub. Link cross-data from your applications and services to the feedback you collect. Segment the data you require. Go one step further with sentiment analysis, thanks to machine learning analysis. A collaborative platform to infuse actions: Assign feedback in your teams, engage your participants, export your data and runStarting Price: $30.00/user/month -
16
Streaming service is a real-time, serverless, Apache Kafka-compatible event streaming platform for developers and data scientists. Streaming is tightly integrated with Oracle Cloud Infrastructure (OCI), Database, GoldenGate, and Integration Cloud. The service also provides out-of-the-box integrations for hundreds of third-party products across categories such as DevOps, databases, big data, and SaaS applications. Data engineers can easily set up and operate big data pipelines. Oracle handles all infrastructure and platform management for event streaming, including provisioning, scaling, and security patching. With the help of consumer groups, Streaming can provide state management for thousands of consumers. This helps developers easily build applications at scale.
-
17
Venturelytic
Venturelytic
Do more deals, better. Model multiple scenarios within minutes. No hassle with cap-tables and return schemes anymore. Venturelytic keeps all your records up-to-date and allows you to quickly assess the impact of different scenarios on your stake and return. Act fast on new information during negotiations with key deal information at your fingertips. Accelerate business growth and boost returns. Our analytics module helps you to evaluate both target and portfolio companies fast. Substantiate your gut feeling with insights from our datahub, that automatically gathers data on the most relevant business drivers. Dive one layer deeper and gain insight into factors determining business success. Spot opportunities quickly. Leverage the power of data from all companies tracked with Venturelytic and spot opportunities and pitfalls early. Build your own proprietary investor intelligence system that boosts returns and start monitoring investments proactively. -
18
TIBCO Streaming
TIBCO
TIBCO Streaming is a real-time analytics platform designed to process and analyze high-velocity data streams, enabling organizations to make immediate, data-driven decisions. It offers a low-code development environment through StreamBase Studio, allowing users to build complex event processing applications with minimal coding. It supports over 150 connectors, including APIs, Apache Kafka, MQTT, RabbitMQ, and databases like MySQL and JDBC, facilitating seamless integration with various data sources. TIBCO Streaming incorporates dynamic learning operators, enabling adaptive machine learning models that provide contextual insights and automate decision-making processes. It also features real-time business intelligence capabilities, allowing users to visualize live data alongside historical information for comprehensive analysis. It is cloud-ready, supporting deployments on AWS, Azure, GCP, and on-premises environments. -
19
Lenses
Lenses.io
Enable everyone to discover and observe streaming data. Sharing, documenting and cataloging your data can increase productivity by up to 95%. Then from data, build apps for production use cases. Apply a data-centric security model to cover all the gaps of open source technology, and address data privacy. Provide secure and low-code data pipeline capabilities. Eliminate all darkness and offer unparalleled observability in data and apps. Unify your data mesh and data technologies and be confident with open source in production. Lenses is the highest rated product for real-time stream analytics according to independent third party reviews. With feedback from our community and thousands of engineering hours invested, we've built features that ensure you can focus on what drives value from your real time data. Deploy and run SQL-based real time applications over any Kafka Connect or Kubernetes infrastructure including AWS EKS.Starting Price: $49 per month -
20
Hitachi Streaming Data Platform
Hitachi
The Hitachi Streaming Data Platform (SDP) is a real-time data processing system designed to analyze large volumes of time-sequenced data as it is generated. By leveraging in-memory and incremental computational processing, SDP enables swift analysis without the delays associated with traditional stored data processing. Users can define summary analysis scenarios using Continuous Query Language (CQL), similar to SQL, allowing for flexible and programmable data analysis without the need for custom applications. The platform's architecture comprises components such as development servers, data-transfer servers, data-analysis servers, and dashboard servers, facilitating scalable and efficient data processing workflows. SDP's modular design supports various data input and output formats, including text files and HTTP packets, and integrates with visualization tools like RTView for real-time monitoring. -
21
IBM StreamSets
IBM
IBM® StreamSets enables users to create and manage smart streaming data pipelines through an intuitive graphical interface, facilitating seamless data integration across hybrid and multicloud environments. This is why leading global companies rely on IBM StreamSets to support millions of data pipelines for modern analytics, intelligent applications and hybrid integration. Decrease data staleness and enable real-time data at scale—handling millions of records of data, across thousands of pipelines within seconds. Insulate data pipelines from change and unexpected shifts with drag-and-drop, prebuilt processors designed to automatically identify and adapt to data drift. Create streaming pipelines to ingest structured, semistructured or unstructured data and deliver it to a wide range of destinations.Starting Price: $1000 per month -
22
KX Streaming Analytics provides the ability to ingest, store, process, and analyze historic and time series data to make analytics, insights, and visualizations instantly available. To help ensure your applications and users are productive quickly, the platform provides the full lifecycle of data services, including query processing, tiering, migration, archiving, data protection, and scaling. Our advanced analytics and visualization tools, used widely across finance and industry, enable you to define and perform queries, calculations, aggregations, machine learning and AI on any streaming and historical data. Deployable across multiple hardware environments, data can come from real-time business events and high-volume sources including sensors, clickstreams, radio-frequency identification, GPS systems, social networking sites, and mobile devices.
-
23
Informatica Data Engineering Streaming
Informatica
AI-powered Informatica Data Engineering Streaming enables data engineers to ingest, process, and analyze real-time streaming data for actionable insights. Advanced serverless deployment option with integrated metering dashboard cuts admin overhead. Rapidly build intelligent data pipelines with CLAIRE®-powered automation, including automatic change data capture (CDC). Ingest thousands of databases and millions of files, and streaming events. Efficiently ingest databases, files, and streaming data for real-time data replication and streaming analytics. Find and inventory all data assets throughout your organization. Intelligently discover and prepare trusted data for advanced analytics and AI/ML projects. -
24
DeltaStream
DeltaStream
DeltaStream is a unified serverless stream processing platform that integrates with streaming storage services. Think about it as the compute layer on top of your streaming storage. It provides functionalities of streaming analytics(Stream processing) and streaming databases along with additional features to provide a complete platform to manage, process, secure and share streaming data. DeltaStream provides a SQL based interface where you can easily create stream processing applications such as streaming pipelines, materialized views, microservices and many more. It has a pluggable processing engine and currently uses Apache Flink as its primary stream processing engine. DeltaStream is more than just a query processing layer on top of Kafka or Kinesis. It brings relational database concepts to the data streaming world, including namespacing and role based access control enabling you to securely access, process and share your streaming data regardless of where they are stored. -
25
MaxCompute
Alibaba Cloud
MaxCompute (previously known as ODPS) is a general-purpose, fully managed, multi-tenancy data processing platform for large-scale data warehousing. MaxCompute supports various data importing solutions and distributed computing models, enabling users to effectively query massive datasets, reduce production costs, and ensure data security. Supports EB-level data storage and computing. Supports SQL, MapReduce, and Graph computational models and Message Passing Interface (MPI) iterative algorithms. Provides more efficient computing and storage services than an enterprise private cloud, and reduces the purchase cost by 20% to 30%. Provides stable offline analysis services for more than seven years, and enables multi-level sandbox protection and monitoring. MaxCompute uses tunnels to transmit data. Tunnels are scalable, and import and export PB-level data on a daily basis. You can import all data or history data through multiple tunnels. -
26
Azure Event Hubs
Microsoft
Event Hubs is a fully managed, real-time data ingestion service that’s simple, trusted, and scalable. Stream millions of events per second from any source to build dynamic data pipelines and immediately respond to business challenges. Keep processing data during emergencies using the geo-disaster recovery and geo-replication features. Integrate seamlessly with other Azure services to unlock valuable insights. Allow existing Apache Kafka clients and applications to talk to Event Hubs without any code changes—you get a managed Kafka experience without having to manage your own clusters. Experience real-time data ingestion and microbatching on the same stream. Focus on drawing insights from your data instead of managing infrastructure. Build real-time big data pipelines and respond to business challenges right away.Starting Price: $0.03 per hour -
27
Kinetica
Kinetica
A scalable cloud database for real-time analysis on large and streaming datasets. Kinetica is designed to harness modern vectorized processors to be orders of magnitude faster and more efficient for real-time spatial and temporal workloads. Track and gain intelligence from billions of moving objects in real-time. Vectorization unlocks new levels of performance for analytics on spatial and time series data at scale. Ingest and query at the same time to act on real-time events. Kinetica's lockless architecture and distributed ingestion ensures data is available to query as soon as it lands. Vectorized processing enables you to do more with less. More power allows for simpler data structures, which lead to lower storage costs, more flexibility and less time engineering your data. Vectorized processing opens the door to amazingly fast analytics and detailed visualization of moving objects at scale. -
28
Azure Data Explorer
Microsoft
Azure Data Explorer is a fast, fully managed data analytics service for real-time analysis on large volumes of data streaming from applications, websites, IoT devices, and more. Ask questions and iteratively explore data on the fly to improve products, enhance customer experiences, monitor devices, and boost operations. Quickly identify patterns, anomalies, and trends in your data. Explore new questions and get answers in minutes. Run as many queries as you need, thanks to the optimized cost structure. Explore new possibilities with your data cost-effectively. Focus on insights, not infrastructure, with the easy-to-use, fully managed data analytics service. Respond quickly to fast-flowing and rapidly changing data. Azure Data Explorer simplifies analytics from all forms of streaming data.Starting Price: $0.11 per hour -
29
BlackLynx Accelerated Analytics
BlackLynx
BlackLynx’s accelerators deliver analytics power where it’s needed and without requiring specialized skills. No matter what your analytics ecosystem includes, you can power data-driven business with powerful, easy-to-use heterogeneous computing. BlackStack software and electronics integration dramatically accelerate processing speeds for sensors deployed within ground, naval, space-based, or airborne assets. Our software enables customers to accelerate relevant AI/ML algorithms or other computing functions with a focus in the areas of real-time sensor processing; including signal detection, video sensors, missiles, radar, thermal, and other object detection capabilities. BlackStack software dramatically accelerates processing speeds for real-time data analytics. We empower our customers to probe enterprise-scale levels of unstructured and fast-changing data to collect, filter, and organize vast amounts of intelligence information or cybersecurity forensic data. -
30
Cloudera DataFlow
Cloudera
Cloudera DataFlow for the Public Cloud (CDF-PC) is a cloud-native universal data distribution service powered by Apache NiFi that lets developers connect to any data source anywhere with any structure, process it, and deliver to any destination. CDF-PC offers a flow-based low-code development paradigm that aligns best with how developers design, develop, and test data distribution pipelines. With over 400+ connectors and processors across the ecosystem of hybrid cloud services—including data lakes, lakehouses, cloud warehouses, and on-premises sources—CDF-PC provides indiscriminate data distribution. These data distribution flows can then be version-controlled into a catalog where operators can self-serve deployments to different runtimes. -
31
SQLstream
Guavus, a Thales company
SQLstream ranks #1 for IoT stream processing & analytics (ABI Research). Used by Verizon, Walmart, Cisco, & Amazon, our technology powers applications across data centers, the cloud, & the edge. Thanks to sub-ms latency, SQLstream enables live dashboards, time-critical alerts, & real-time action. Smart cities can optimize traffic light timing or reroute ambulances & fire trucks. Security systems can shut down hackers & fraudsters right away. AI / ML models, trained by streaming sensor data, can predict equipment failures. With lightning performance, up to 13M rows / sec / CPU core, companies have drastically reduced their footprint & cost. Our efficient, in-memory processing permits operations at the edge that are otherwise impossible. Acquire, prepare, analyze, & act on data in any format from any source. Create pipelines in minutes not months with StreamLab, our interactive, low-code GUI dev environment. Export SQL scripts & deploy with the flexibility of Kubernetes. -
32
Fluentd
Fluentd Project
A single, unified logging layer is key to make log data accessible and usable. However, existing tools fall short: legacy tools are not built for new cloud APIs and microservice-oriented architecture in mind and are not innovating quickly enough. Fluentd, created by Treasure Data, solves the challenges of building a unified logging layer with a modular architecture, an extensible plugin model, and a performance optimized engine. In addition to these features, Fluentd Enterprise addresses Enterprise requirements such as Trusted Packaging. Security. Certified Enterprise Connectors, Management / Monitoring, and Enterprise SLA-Based Support, Assurance, and Enterprise Consulting Services -
33
Amazon MSK
Amazon
Amazon Managed Streaming for Apache Kafka (Amazon MSK) is a fully managed service that makes it easy for you to build and run applications that use Apache Kafka to process streaming data. Apache Kafka is an open-source platform for building real-time streaming data pipelines and applications. With Amazon MSK, you can use native Apache Kafka APIs to populate data lakes, stream changes to and from databases, and power machine learning and analytics applications. Apache Kafka clusters are challenging to setup, scale, and manage in production. When you run Apache Kafka on your own, you need to provision servers, configure Apache Kafka manually, replace servers when they fail, orchestrate server patches and upgrades, architect the cluster for high availability, ensure data is durably stored and secured, setup monitoring and alarms, and carefully plan scaling events to support load changes.Starting Price: $0.0543 per hour -
34
Kapacitor
InfluxData
Kapacitor is a native data processing engine for InfluxDB 1.x and is an integrated component in the InfluxDB 2.0 platform. Kapacitor can process both stream and batch data from InfluxDB, acting on this data in real-time via its programming language TICKscript. Today’s modern applications require more than just dashboarding and operator alerts—they need the ability to trigger actions. Kapacitor’s alerting system follows a publish-subscribe design pattern. Alerts are published to topics and handlers subscribe to a topic. This pub/sub model and the ability for these to call User Defined Functions make Kapacitor very flexible to act as the control plane in your environment, performing tasks like auto-scaling, stock reordering, and IoT device control. Kapacitor provides a simple plugin architecture, or interface, that allows it to integrate with any anomaly detection engine.Starting Price: $0.002 per GB per hour -
35
Google Cloud Dataflow
Google
Unified stream and batch data processing that's serverless, fast, and cost-effective. Fully managed data processing service. Automated provisioning and management of processing resources. Horizontal autoscaling of worker resources to maximize resource utilization. OSS community-driven innovation with Apache Beam SDK. Reliable and consistent exactly-once processing. Streaming data analytics with speed. Dataflow enables fast, simplified streaming data pipeline development with lower data latency. Allow teams to focus on programming instead of managing server clusters as Dataflow’s serverless approach removes operational overhead from data engineering workloads. Allow teams to focus on programming instead of managing server clusters as Dataflow’s serverless approach removes operational overhead from data engineering workloads. Dataflow automates provisioning and management of processing resources to minimize latency and maximize utilization. -
36
IBM Streams
IBM
IBM Streams evaluates a broad range of streaming data — unstructured text, video, audio, geospatial and sensor — helping organizations spot opportunities and risks and make decisions in real-time. Make sense of your data, turning fast-moving volumes and varieties into insight with IBM® Streams. Streams evaluate a broad range of streaming data — unstructured text, video, audio, geospatial and sensor — helping organizations spot opportunities and risks as they happen. Combine Streams with other IBM Cloud Pak® for Data capabilities, built on an open, extensible architecture. Help enable data scientists to collaboratively build models to apply to stream flows, plus, analyze massive amounts of data in real-time. Acting upon your data and deriving true value is easier than ever. -
37
WarpStream
WarpStream
WarpStream is an Apache Kafka-compatible data streaming platform built directly on top of object storage, with no inter-AZ networking costs, no disks to manage, and infinitely scalable, all within your VPC. WarpStream is deployed as a stateless and auto-scaling agent binary in your VPC with no local disks to manage. Agents stream data directly to and from object storage with no buffering on local disks and no data tiering. Create new “virtual clusters” in our control plane instantly. Support different environments, teams, or projects without managing any dedicated infrastructure. WarpStream is protocol compatible with Apache Kafka, so you can keep using all your favorite tools and software. No need to rewrite your application or use a proprietary SDK. Just change the URL in your favorite Kafka client library and start streaming. Never again have to choose between reliability and your budget.Starting Price: $2,987 per month -
38
Google Cloud Pub/Sub
Google
Google Cloud Pub/Sub. Scalable, in-order message delivery with pull and push modes. Auto-scaling and auto-provisioning with support from zero to hundreds of GB/second. Independent quota and billing for publishers and subscribers. Global message routing to simplify multi-region systems. High availability made simple. Synchronous, cross-zone message replication and per-message receipt tracking ensure reliable delivery at any scale. No planning, auto-everything. Auto-scaling and auto-provisioning with no partitions eliminate planning and ensures workloads are production-ready from day one. Advanced features, built in. Filtering, dead-letter delivery, and exponential backoff without sacrificing scale help simplify your applications. A fast, reliable way to land small records at any volume, an entry point for real-time and batch pipelines feeding BigQuery, data lakes and operational databases. Use it with ETL/ELT pipelines in Dataflow. -
39
Amazon Kinesis
Amazon
Easily collect, process, and analyze video and data streams in real time. Amazon Kinesis makes it easy to collect, process, and analyze real-time, streaming data so you can get timely insights and react quickly to new information. Amazon Kinesis offers key capabilities to cost-effectively process streaming data at any scale, along with the flexibility to choose the tools that best suit the requirements of your application. With Amazon Kinesis, you can ingest real-time data such as video, audio, application logs, website clickstreams, and IoT telemetry data for machine learning, analytics, and other applications. Amazon Kinesis enables you to process and analyze data as it arrives and respond instantly instead of having to wait until all your data is collected before the processing can begin. Amazon Kinesis enables you to ingest, buffer, and process streaming data in real-time, so you can derive insights in seconds or minutes instead of hours or days. -
40
Logstash
Elasticsearch
Centralize, transform & stash your data. Logstash is a free and open server-side data processing pipeline that ingests data from a multitude of sources, transforms it, and then sends it to your favorite "stash." Logstash dynamically ingests, transforms, and ships your data regardless of format or complexity. Derive structure from unstructured data with grok, decipher geo coordinates from IP addresses, anonymize or exclude sensitive fields, and ease overall processing. Data is often scattered or siloed across many systems in many formats. Logstash supports a variety of inputs that pull in events from a multitude of common sources, all at the same time. Easily ingest from your logs, metrics, web applications, data stores, and various AWS services, all in continuous, streaming fashion. Download: https://sourceforge.net/projects/logstash.mirror/ -
41
Materialize
Materialize
Materialize is a reactive database that delivers incremental view updates. We help developers easily build with streaming data using standard SQL. Materialize can connect to many different external sources of data without pre-processing. Connect directly to streaming sources like Kafka, Postgres databases, CDC, or historical sources of data like files or S3. Materialize allows you to query, join, and transform data sources in standard SQL - and presents the results as incrementally-updated Materialized views. Queries are maintained and continually updated as new data streams in. With incrementally-updated views, developers can easily build data visualizations or real-time applications. Building with streaming data can be as simple as writing a few lines of SQL.Starting Price: $0.98 per hour -
42
Redpanda
Redpanda Data
Redpanda is pioneering the Agentic Data Plane (ADP) - a new category in AI infrastructure that makes it simple and secure to connect AI agents with enterprise data and systems. Built on a multi-modal data streaming engine, Redpanda empowers agentic applications that reason and act in real-time with speed, autonomy, and precision. Global leaders including Activision Blizzard, Cisco, Moody's, Texas Instruments, Vodafone and 2 of the top 5 banks in the U.S. rely on Redpanda to process hundreds of terabytes of data a day. Backed by premier venture investors Lightspeed, GV and Haystack VC, Redpanda is a diverse, people-first organization with teams distributed around the globe. -
43
Apama
Apama
Apama Streaming Analytics allows organizations to analyze and act on IoT and fast-moving data in real-time, responding to events intelligently the moment they happen. Apama Community Edition is a freemium version of Apama by Software AG that can be used to learn about, develop and put streaming analytics applications into production. The Software AG Data & Analytics Platform is an end-toend, modular and integrated set of world-class capabilities optimized for high-speed data management and analytics on real-time data and offering out-of-the-box integration and connectivity to all key enterprise data sources. Choose the capabilities you need: streaming, predictive and visual analytics along with messaging for easy integration with other enterprise apps and an in-memory data store for extremely fast access. Integrate historical and other data for comparison—ideal when building models or enriching customer and other vital data. -
44
Oracle Stream Analytics
Oracle
Oracle Stream Analytics allows users to process and analyze large scale real-time information by using sophisticated correlation patterns, enrichment, and machine learning. It offers real-time actionable business insight on streaming data and automates action to drive today’s agile businesses. Visual GEOProcessing with GEOFence relationship spatial analytics. New Expressive Patterns Library, including Spatial, Statistical, General industry and Anomaly detection, streaming machine learning. Abstracted visual façade to interrogate live real time streaming data and perform intuitive in-memory real time business analytics. -
45
Apache Flink
Apache Software Foundation
Apache Flink is a framework and distributed processing engine for stateful computations over unbounded and bounded data streams. Flink has been designed to run in all common cluster environments, perform computations at in-memory speed and at any scale. Any kind of data is produced as a stream of events. Credit card transactions, sensor measurements, machine logs, or user interactions on a website or mobile application, all of these data are generated as a stream. Apache Flink excels at processing unbounded and bounded data sets. Precise control of time and state enable Flink’s runtime to run any kind of application on unbounded streams. Bounded streams are internally processed by algorithms and data structures that are specifically designed for fixed sized data sets, yielding excellent performance. Flink is designed to work well each of the previously listed resource managers. -
46
Axual
Axual
Axual is Kafka-as-a-Service for DevOps teams. Empower your team to unlock insights and drive decisions with our intuitive Kafka platform. Axual offers the ultimate solution for enterprises looking to seamlessly integrate data streaming into their core IT infrastructure. Our all-in-one Kafka platform is designed to eliminate the need for extensive technical knowledge or skills, and provides a ready-made solution that delivers all the benefits of event streaming without the hassle. The Axual Platform is a all-in-one solution, designed to help you simplify and enhance the deployment, management, and utilization of real-time data streaming with Apache Kafka. By providing an array of features that cater to the diverse needs of modern enterprises, the Axual Platform enables organizations to harness the full potential of data streaming while minimizing complexity and operational overhead. -
47
Evam's Continuous Intelligence Platform combines multiple products for processing and visualizing real-time data. It runs real-time machine learning models on streaming data, while enriching the real-time data with a smart in-memory caching mechanism. EVAM empowers telecommunications, financial services, retail, transportation and travel companies to maximize their business value. Through continuous intelligence platform with machine learning capabilities. EVAM processes real-time data and designs and orchestrates customer journeys visually with advanced analytical models, machine learning, and artificial intelligence algorithms. EVAM enables enterprises to engage their customers using their data across all channels, including legacy ones, in real-time. Collect billions of events and process them in real-time. Understand each customer's needs and attract, engage, and retain them more effectively.
-
48
SAS Event Stream Processing
SAS Institute
Streaming data from operations, transactions, sensors and IoT devices is valuable – when it's well-understood. Event stream processing from SAS includes streaming data quality and analytics – and a vast array of SAS and open source machine learning and high-frequency analytics for connecting, deciphering, cleansing and understanding streaming data – in one solution. No matter how fast your data moves, how much data you have, or how many data sources you’re pulling from, it’s all under your control via a single, intuitive interface. You can define patterns and address scenarios from all aspects of your business, giving you the power to stay agile and tackle issues as they arise. -
49
Digital Twin Streaming Service
ScaleOut Software
ScaleOut Digital Twin Streaming Service™ Easily build and deploy real-time digital twins for streaming analytics Connect to many data sources with Azure & AWS IoT hubs, Kafka, and more Maximize situational awareness with live, aggregate analytics. Introducing a breakthrough cloud service that simultaneously tracks telemetry from millions of data sources with “real-time” digital twins — enabling immediate, deep introspection with state-tracking and highly targeted, real-time feedback for thousands of devices. A powerful UI simplifies deployment and displays aggregate analytics in real time to maximize situational awareness. Ideal for a wide range of applications, including the Internet of Things (IoT), real-time intelligent monitoring, logistics, and financial services. Simplified pricing makes getting started fast and easy. Combined with the ScaleOut Digital Twin Builder software toolkit, the ScaleOut Digital Twin Streaming Service enables the next generation in stream processing. -
50
Xeotek
Xeotek
Xeotek helps companies develop and explore their data applications and streams faster with Xeotek's powerful desktop and web application. Xeotek KaDeck was designed to be used by developers, operations, and business users alike. Because business users, developers, and operations jointly gain insight into data and processes via KaDeck, the whole team benefits: fewer misunderstandings, less rework, more transparency. Xeotek KaDeck puts you in control of your data streams. Save hours of work by gaining insights at the data and application level in projects or day-to-day operations. Export, filter, transform and manage data streams in KaDeck with ease. Run JavaScript (NodeV4) code, transform & generate test data, view & change consumer offsets, manage your streams or topics, Kafka Connect instances, schema registry, and ACLs – all from one convenient user interface.