Here’s a link to my slides from the workshop I delivered for QCon Sao Paulo, Brazil, “Real-world Cloud Big Data Patterns” Enjoy!
In this whitepaper, I take a look at the various options for Hadoop Streaming. These include Apache Storm, Apache Spark Streaming and Apache Samza. Also I examine commercial alternatives, such as Data Torrent. I cover implementation details of streaming, including type of streaming and capacities of libraries and products included. You can read this whitepaper […]
Here’s a whitepaper I wrote on the ‘state of Machine Learning’. It includes information about implementation via various cloud-based ML services (AWS, Azure, IBM) as well as category information (for architects). Your are welcome to read this whitepaper online or to download it if you prefer (linked to Slideshare source). Enjoy!
In this post I’ll summarize what I learned from running benchmark tests on virtual machines on the AWS Cloud with the Aerospike team and also as I validated their test results independently. I’ll also discuss benchmarking techniques & results for this particular set of test databases. In the process of validating benchmarks, I learned many broadly […]
I’ve been doing some work with the super fast in-memory database, Aerospike lately. See previous blog posts here about the speed of this product. Since I’ve started work w/Aerospike, the team there has announced that their core product is now open source. In this blog post, I’ll be covering how to get started developing with […]
Recently, I’ve been doing some work with AerospikeDB. It is a super-fast in-memory NoSQL Database. I gave a presentation at the recent BigDataCampLA on ‘Bleeding Edge Databases’ and included it because of impressive benchmarks, such as 1 Million TPS (read-only workload) PER SERVER and 40K TPS (read-write) on that same server. Here’s the live presentation, […]
I gave a talk called ‘Bleeding Edge Databases’ at this weekend’s BigDataCampLA. Several attendees asked me to record the talk, so I did (and will link that post below). Here’s my favorite tweet from the event. @lynnlangit #BigDataCamp "stunned by GCloud performance" and confident enough to spin […]