Uncategorized

4 Million TPS GCP with Aerospike

Aerospike Whitesands Tool
I did some work with the Aerospike team and some other partners (@dchaley and @jamesrcounts) to validate Aerospike performance benchmarks on the Google Cloud using GCE instances.

In addition to blogging about the relatively simple 6-step process of setting up a 20-node cluster to get this mind-boggling performance, my team also wrote some scripts so that you can easily replicate our work.

Also, I recorded a screencast about Aerospike which includes a live demo of the performance benchmark – guess what? we actually got an even HIGHER benchmark tonight – between 5 and 6 MILLION TPS for read-only workloads. We also added a test for a mixed workload – 50% read/50% write. We got over 1 MILLION TPS for both the reads and the writes using the same size cluster – BAM!

Uncategorized

Redshift Data Warehouse w/Matillion

Building a Data Warehouse on AWS
I led a great team at this year’s AWS re:Invent conference in building a workshop for attendees. We took on the daunting task of creating courseware for teams of students to build an end-to-end data warehouse in just two hours. Happily, all teams were successful!

So, how did we do it? We used AWS:Marketplace partners to ‘speed up’ our time-to-value. Specifically, we used Matillion ETL for Redshift to load and transform our data. Then we used Tableau to create a dashboard.

Want to know more?

I’ve posted our session notes / setup on slideshare for you to review.
Also, I’ve posted a setup guide on GitHub. This includes AWS cli commands for you to use if you wish to duplicate this exercise yourself.

Also, I’m part of a new site that AWS launched to help you to understand exactly what selected AWS:Marketplace Big Data partners have to offers. Here you’ll find interviews with technical leads from these companies, where we discuss what exactly their product is and does, architectural patterns, common use case and also customer success stories. Content is targeted at technical architects.

How do you use AWS Redshift? Which AWS:Marketplace Big Data partners have you explored? I’d love to hear from you in the comments section below.

Uncategorized

TKP Courseware Influences

We often get asked ‘what are the influences’ for TKP courseware? TKP courseware includes TKPJava, TKPSmallBasic and new courseware around Data Science and IoT concepts.

In addition to to the work of the TKP team that has created TKPJava courseware, the team is inspired by many other influences.  These influences are varied and many (and listed below), in particular the ideas in this book inspire many of our lesson concepts:

TurtleGeometry

Uncategorized

Learn GitHub – Screencast series

I struggled through various aspects of GitHub when I first started using it about 2 years ago.  Now it’s become part of my daily routine.  Lately, I’ve gotten more and more requests to answer “What am I sure is a stupid question” from friends.  To that end, I decided to create a short screencast series in which I will show the practicalities of using GitHub.

Commit Yourself and Enjoy!

Part 1 – What is GitHub?

Part 2 – Getting Started with GitHub – Users and Repositories

Part 3 – Working with Repositories – working with push, pull and clone; also performing basic commits to both local and remote repos

Part 4 – Handling Conflict – working with repo branches and forks; performing merges

Part 5 – Bonus – understanding and using data from your Repositories