Category Archives: Big Data

Whitepaper – Streaming Hadoop Solutions

In this whitepaper, I take a look at the various options for Hadoop Streaming.  These include Apache Storm, Apache Spark Streaming and Apache Samza.  Also I examine commercial alternatives, such as Data Torrent.  I cover implementation details of streaming, including … Continue reading

Posted in Big Data, Cloud, Hadoop | 1 Comment

Whitepaper – Practical Machine Learning

Here’s a whitepaper I wrote on the ‘state of Machine Learning’.  It includes information about implementation via various cloud-based ML services (AWS, Azure, IBM) as well as category information (for architects).  Your are welcome to read this whitepaper online or … Continue reading

Posted in AWS, Azure, Big Data, Cloud, Data Science | Leave a comment

Lessons Learned – Benchmarking NoSQL on the AWS Cloud (AerospikeDB and Redis)

In this post I’ll summarize what I learned from running benchmark tests on virtual machines on the AWS Cloud with the Aerospike team and also as I validated their test results independently. I’ll also discuss benchmarking techniques & results for this … Continue reading

Posted in Big Data, Cloud, noSQL | Tagged , , | Leave a comment

How to: Developing for Aerospike with Python or C#

I’ve been doing some work with the super fast in-memory database,  Aerospike lately.  See previous blog posts here about the speed of this product.  Since I’ve started work w/Aerospike, the team there has announced that their core product is now … Continue reading

Posted in Big Data, Cloud | Leave a comment

How to: Installing AerospikeDB on Google Compute Engine

Recently, I’ve been doing some work with AerospikeDB.  It is a super-fast in-memory NoSQL Database.  I gave a presentation at the recent BigDataCampLA on ‘Bleeding Edge Databases’ and included it because of impressive benchmarks, such as 1 Million TPS (read-only … Continue reading

Posted in Agile, Big Data, Cloud, google, noSQL | Leave a comment

Bleeding Edge Databases – Aerospike, Algebraix and Google Big Query

I gave a talk called ‘Bleeding Edge Databases’ at this weekend’s BigDataCampLA.  Several attendees asked me to record the talk, so I did (and will link that post below).                 Here’s my favorite … Continue reading

Posted in Big Data, Cloud, google | Leave a comment

Code Sample for D&B Business Verification API published

I’ve been doing work with Dun & Bradstreet (and am also a D&B MVP), to that end, I wrote a code sample for working with their Business Verification service API in C# and published it to GitHub. D&B’s Business Verification … Continue reading

Posted in Azure, Big Data | Tagged | Leave a comment