distributed systems – N Choose K

↧

Image may be NSFW.
Clik here to view.

Cassandra and Hadoop – Introducing the KassandraMRHelper

November 7, 2013, 10:08 am

Here at Knewton we use Cassandra for storing a variety of data. Since we follow a service-oriented architecture, many of our internal services are backed by their own data store. Some of the types of...

View Article

Image may be NSFW.
Clik here to view.

Kankoku: A Distributed Framework For Implementing Statistical Models (Part 2)

September 22, 2014, 11:40 am

The focus of my internship project this summer was to extend Kankoku (Knewton’s scientific computing framework) to operate in a more distributed fashion. There are a few reasons that drove this change...

View Article

Image may be NSFW.
Clik here to view.

Kankoku: A Distributed Framework for Implementing Statistical Models

September 22, 2014, 11:53 am

As future-facing as Knewton’s adaptive learning platform may be, the concept of a personalized classroom has a surprisingly rich history. The idea has intrigued educators and philosophers for decades....

View Article

Image may be NSFW.
Clik here to view.

How Knewton Cutover the Core of its Infrastructure from Kafka 0.7 to Kafka 0.8

September 28, 2015, 9:24 am

Kafka has been a key component of the Knewton architecture for several years now. Knewton has 17 Kafka topics consumed by 33 services. So when it came time to upgrade from Kafka 0.7 to Kafka 0.8 it was...

View Article

Image may be NSFW.
Clik here to view.

Rolling Out the Mesos Slave Roller

March 21, 2016, 7:08 am

A few months ago, Knewton started running most services via Docker containers, deployed to an Apache Mesos cluster with a Marathon scheduler. This new infrastructure makes it easy to deploy and manage...

View Article

Image may be NSFW.
Clik here to view.

Distributed Tracing: Design and Architecture

April 21, 2016, 2:18 pm

The previous blog post talked about why Knewton needed a distributed tracing system and the value it can add to a company. This section will go into more technical detail as to how we implemented our...

View Article

Distributed Tracing: Observations in Production

April 28, 2016, 2:27 pm

Previous blog posts have explained Knewton’s motivation for implementing distributed tracing, and the architecture we put together for it. At Knewton, the major consumers of tracing are ~80 engineers...

View Article

Image may be NSFW.
Clik here to view.

Digging Deep Into Cassandra Thrift Buffer Behavior

August 30, 2016, 9:50 am

Everyone who works in tech has had to debug a problem. Hopefully it is as simple as looking into a log file, but many times it is not. Sometimes the problem goes away and sometimes it only looks like...

View Article

Image may be NSFW.
Clik here to view.

Simplifying Cassandra Heap Size Allocation

September 7, 2016, 10:36 am

As discussed previously, Knewton has a large Cassandra deployment to meet its data store needs. Despite best efforts to standardize configurations across the deployment, the systems are in a...

View Article

Image may be NSFW.
Clik here to view.

Analyzing Java “Garbage First Garbage Collection” (G1GC) Logs

October 11, 2016, 12:22 pm

Garbage Collection can take a big toll on any Java application, so it’s important to understand its behavior and impact. After a JVM upgrade of Knewton’s Cassandra database, we needed a tool to compare...

View Article