vs Big Data vs Data Mining Lately, there have been tremendous shifts in the business technology landscape. Advances in cloud technology and mobile applications have enabled businesses and IT users to interact in entirely new ways. To help you understand the various concepts in business data concepts, it is important to understand the difference between … Continue reading BI vs Big Data vs Data Mining
Month: December 2018
1. Objective We will discuss the Comparison between Hadoop 2.x vs Hadoop 3.x. What are the new features added in Hadoop version 3, is Hadoop 2 programs compatible in Hadoop 3, what are the difference between Hadoop 2 and Hadoop 3? We hope that this Feature wise difference between Hadoop 2 and Hadoop 3. will … Continue reading Comparison Between Hadoop 2.x vs Hadoop 3.x
What are the differences between traditional or RDBMS and Hadoop database systems? Both traditional relational (RDBMS) and Hadoop database systems have similar functionalities in terms of collection, storage, processing, recovery, extraction and data manipulation. However, they use radically different approaches in terms of data processing, and the problems they are trying to solve. RDBMS systems … Continue reading How to decide between RDBMS and HADOOP?
Why did we start on this path? It all starts with our customers’ hybrid data management strategy. The need to embrace the proliferation of data that is creating new opportunities for businesses to better understand their customers, their industry and their own operations. What do I mean by “proliferation?” Well, recent studies have suggested that … Continue reading How to build with IBM and MongoDB Enterprise Document Store
If you don’t know what data you have, how can you manage it effectively and generate value from it? With continued growth and a series of fast-paced bank acquisitions and mergers, BBVA Compass’s data grew to over 2.5 petabytes (PB) of data. Much of the data was spread across shared network drives and various legacy … Continue reading 5 data governance lessons from gardening
Hadoop File System was developed using distributed file system design. It is run on commodity hardware. Unlike other distributed systems, HDFS is highly faulttolerant and designed using low-cost hardware. HDFS holds very large amount of data and provides easier access. To store such huge data, the files are stored across multiple machines. These files are … Continue reading Hadoop – HDFS Overview
Time series is a sequence of observations of categorical or numeric variables indexed by a date, or timestamp. A clear example of time series data is the time series of a stock price. In the following table, we can see the basic structure of time series data. In this case the observations are recorded every … Continue reading Big Data Analytics – Time Series Analysis