Tag Archives: Map Reduce

Ordered Sets and Logs in Cassandra vs SQL


I’ve written before that Cassandra’s achilles’ heel is devops: Storage, redundancy and performance are expanded by adding more nodes. This can happen during normal business hours as long as consistency parameters are met. Same applies to node replacements. As the number of servers grows be prepared to hire a devops army or look for a […]

http://thedulinreport.com/2015/04/08/ordered-sets-and-logs-in-cassandra-vs-sql/

Ordered Sets and Logs in Cassandra vs SQL


I’ve written before that Cassandra’s achilles’ heel is devops: Storage, redundancy and performance are expanded by adding more nodes. This can happen during normal business hours as long as consistency parameters are met. Same applies to node replacements. As the number of servers grows be prepared to hire a devops army or look for a […]

http://thedulinreport.com/2015/04/08/ordered-sets-and-logs-in-cassandra-vs-sql/

Video: Getting Started with Microsoft Big Data


Introduction to Big Data

1 – In this module, you will get an overview of the Microsoft Big Data story and see how to use the Big Data Lambda architecture to describe solutions. You’ll learn how to setup a Windows Azure HD Insight cluster with storage used for the rest of the modules.

Introduction to Map Reduce

2 – In this module, you will learn the essentials of Map Reduce with Hadoop. You will see how to create a Map Reduce job using C# to process data and then run the job on an HDInsight cluster.

Introduction to Hive and HiveQL

3- In this module, you will learn how to leverage your SQL skills by using Hive and HiveQL to create tables and views and run queries on top of Hadoop data using an HDInsight cluster.

Developing Big Data Applications with NET

4 – In this module, you will learn how to use .NET Language Integrated Query (LINQ) to author Hive queries against Hadoop data. You’ll also learn how to create streaming data applications using the .NET Reactive Extensions framework.

A practical comparison of Map-Reduce in MongoDB and RavenDB


A practical comparison of Map-Reduce in MongoDB and RavenDB

Over the past week I’ve been learning about MongoDB. In my adventures with Node.js I am now at the point where I need some kind of data store.

I’ve found MongoDB fairly easy to grasp due to their excellent documentation and that I have experience with RavenDB. Therefore many of the concepts surrounding NoSQL/Document databases are already familiar to me.

I thought it would be interesting to do a practical comparison of Map-Reduce in both MongoDB and RavenDB.

To begin with I created a bunch of “blog post” documents in both database with the following structure: