The Big Data List

Big Data has been the hot topic in BI over the past few years. What are the Big Data sources that you should know about? Here's a list of some major names in the Big Data space.

Big Data Infrastructure Vendors

Cloudera

A software company that provides Apache Hadoop-based software, targeting enterprise-class Hadoop deployments.

Cloudera contains the main, core elements of Hadoop, providing reliable, scalable distributed data processing of large data sets (chiefly MapReduce and HDFS), as well as other enterprise-oriented components that provide security, high availability, and integration with hardware and other software.

SAP HANA

HANA is an in-memory, column-oriented, relational database management system developed and marketed by SAP AG. HANA is an acronym for "High-Performance Analytic Appliance" based on in-memory technology, enabling Big Data analysis to be performed at faster speeds.

Hortonworks

Started by funding from Yahoo! as an independent company, Hortonworks provides an enterprise-ready version of Hadoop. A Microsoft and Yahoo! partner.

MapR

MapR provides a commercial distribution of Hadoop as an alternative to Cloudera and Hortonworks.

#1 Ranking: Read how InetSoft was rated #1 for user adoption in G2's user survey-based index Read More

machine learning reporting dashboard example

Easy to Use Business Intelligence Tool Example

Big Data Terms

MapReduce

Created by Google, MapReduce is an infrastructure or framework for storing massive amounts of data, that orchestrates by marshalling distributed servers, running the various tasks in parallel, and managing all communications and data transfers between the various parts of the system. The model is inspired by the map and reduce functions commonly used in functional programming.

The map procedure performs filtering and sorting (such as sorting students by first name into queues, one queue for each name) and the reduce procedure performs a summary operation (such as counting the number of students in each queue, yielding name frequencies).

Hadoop

An open-source framework for large-scale data processing and storage on clusters of servers. Hadoop was created by Apache in 2005, and is based on the MapReduce paradigm.

Hadoop is considered to be an ideal environment for extracting and transforming huge volumes of data. Also, Hadoop is known for providing a scalable, reliable and distributed processing environment.

InetSoft for Big Data Analytics and Dashboarding

What do all of these Big Data sources have in common? They can all be accessed and integrated with other sources using InetSoft's Style Intelligence.

Holding to the open source standards, InetSoft's solution can access all major Big Data sources, as well as a broad array of more traditional sources. Style Intelligence's robust data mashup engine extracts data from all sources and allows non-technical users to create their own data mashups using a simple drag-and-drop tool.

Once mashups are created, the data grid cache pulls the data necessary for reports or analysis and stores it in-memory, enabling analytics to be performed on Big Data sources at very high speed.

To unleash the potential of Big Data, consider InetSoft for your dashboarding and reporting needs.

About InetSoft

Since 1996 InetSoft has been delivering easy, agile, and robust business intelligence software that makes it possible for organizations and solution providers of all sizes to deploy or embed full-featured business intelligence solutions.

Application highlights include visually-compelling and interactive dashboards that ensure greater end-user adoption plus pixel-perfect report generation, scheduling, and bursting. InetSoft's patent pending Data Block™ technology enables productive reuse of queries and a unique capability for end-user defined data mashup. This capability combined with efficient information access enabled by InetSoft's visual analysis technologies allows maximum self-service that benefits the average business user, the IT administrator, and the developer.

InetSoft solutions have been deployed at over 5,000 organizations worldwide, including 25% of Fortune 500 companies, spanning all types of industries.

Big Data Software Customers