Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

In Production - KubeCon [clear filter]
Thursday, December 7


Embracing Cloud Native at a Thriving, Established Company - Brian Akins, MailChimp
We are in the midst of a major shift at MailChimp. In many ways, we are a microcosm of the industry as a whole: moving from large monoliths to microservices and trying to figure out what that even means. I will discuss the hands-on, real world experiences we have had as we embrace microservice techniques and technologies. I’ll discuss why we choose Kubernetes, Prometheus, and other cloud native technologies. I’ll show our approach to building and operating multiple on premise, bare metal clusters. We’ll talk about our existing development and deployment pipeline as well as our current experimental projects. We’ve had a few false starts and failures and will discuss those to help others possibly avoid the same issues. Finally, I’ll speak candidly about the struggles we’ve had getting organizational momentum for this transformation.

avatar for Brian Akins

Brian Akins

Principal Engineer, MailChimp
Brian is a 20 year industry veteran.He has done a bit of everything - from assembly to CSS racking servers to building distributed systems. For the last few years, Brian has been focused on building and operating infrastructure using components such as containers, Kubernetes, Prometheus... Read More →

Thursday December 7, 2017 11:10am - 11:45am
Ballroom B, Level 1


Running Mixed Workloads on Kubernetes at the Institute for Health Metrics and Evaluation - Dr. Tyrone Grandison, Institute for Health Metrics and Evaluation (IHME), University of Washington
The mission of the IHME is to apply rigorous measurement and analysis to help policy makers make better decisions on a range of health policy issues. Like other organizations, the IHME have embraced containers and micro-services aggressively to better support hundreds of collaborating researchers.

In addition to containerized workloads, the IHME run a wide-variety of traditional analytic, simulation and high-performance computing workloads on an HPC cluster with 15,000 cores and 13PB of storage. Researchers increasingly need to combine both containerized and non-containerized elements into workflow pipelines, and a key challenge has been ensuring SLAs for various departments and avoiding duplicate infrastructure and unnecessary data movement and duplication. In collaboration with industry partners, IHME have deployed a unique solution based on Univa’s Navops technology that allows them to combine containerized and traditional analytic and high-performance application workloads on a single shared Kubernetes cluster, ensuring departmental SLAs and helping contain infrastructure costs.

In this talk Dr. Grandison will discuss IHME, their experience deploying containerized applications and how they went about using Kubernetes to support a variety of new containerized applications as well as a variety of traditional analytic applications.

avatar for Dr Tyrone Grandison

Dr Tyrone Grandison

Chief Information Officer, Institute for Health Metrics and Evaluation (IHME), University of Washington
Tyrone is the Chief Information Officer leading the IT team at the IHME, independent global health research center at the University of Washington. The IHME provides rigorous and comparable measurement of the world’s most important health problems and evaluates the strategies used... Read More →

Thursday December 7, 2017 11:55am - 12:30pm
Ballroom B, Level 1


Scaling to 5000+ Unique K8s Deployments, How We Did It [I] - Nicole Hubbard, WP Engine
Most organizations only need to run a couple deployments of their application in Kubernetes. In these situations, deploying onto Kubernetes clusters is relatively straightforward. What happens when you need to simultaneously deploy 5,000 unique instance of your application to different Kubernetes clusters at different providers worldwide?

Over the last year, we have worked to move over 60,000 of our customers' unique workloads from virtual machines onto Kubernetes. I will share our experiences on how to automate and simplify managing unique Kubernetes workloads at scale.

avatar for Nicole Hubbard

Nicole Hubbard

Architect, WP Engine
Nicole Hubbard is an Architect at WP Engine where she focuses on building container based infrastructure, automation and helping teams deploy their applications.

Thursday December 7, 2017 2:00pm - 2:35pm
Ballroom B, Level 1


Automating and Testing Production Ready Kubernetes Clusters in the Public Cloud - Ron Lipke, Gannet/USA Today Network
As a large enterprise organization with legacy infrastructure, we were interested in adopting Kubernetes in our internal Platform as a Service in the public cloud. However, we faced several challenges not addressed by the turn key offerings on the market, such as:

- Maintain control over network architecture within the public cloud to integrate with our internal resource
- Allow teams to easily spin up kubernetes clusters on their own for faster development cycles while retaining cost boundaries and charge-back insight
- Quickly iterate as new kubernetes versions are released and make new features available to end-users (most recently: Role Based Access Controls and StatefulSets)

We will share our experience of using configuration management to automate the testing, building and deployment of production ready cloud agnostic kubernetes clusters to the AWS and Google clouds. We will also discuss examples of moving some of our largest application workloads to these clusters.

avatar for Ron Lipke

Ron Lipke

Senior Developer, Platform as a Service, Gannet/USA Today Network
Nuclear plant operator turned cloud person

Thursday December 7, 2017 2:45pm - 3:20pm
Ballroom B, Level 1


Large Scale Teaching Infrastructure with Kubernetes - Yuvi Panda, Berkeley University

Data Science & Programming literacy is an important aspect of literacy in the 21st century, but teaching these skills at scale is quite difficult. At UC Berkeley, we are trying - our 'Foundations of Data Science' course has no pre-requisites, and routinely attracts more than a 1000 students from across majors. 

Requiring students to have local programming environments installed & debugged is a non-starter at this scale. We have been running a Kubernetes based JupyterHub environment that allows them to do all their programming with a web based environment with Jupyter Notebooks. This is an important change in many ways:

1. Lets students start instantly with writing code, rather than dealing with the accidental complexity of installing software locally

2. Acts as an equalizer - a student using a chromebook borrowed from the library has no disadvantage over someone using an expensive Macbook Pro

3. This is course critical infrastructure, and needs high availability at low human / dollar cost

In this talk we'll go over how we have:

1. Used Kubernetes to make reduce our costs while allowing a larger group of people to deploy safely to various cloud providers.

2. Extracted our JupyterHub deployment into a project part of Project Jupyter (Zero to JupyterHub) that is being adopted at other universities & organizations.


Yuvi Panda

UC Berkeley, Data Science Education Program

Thursday December 7, 2017 3:50pm - 4:25pm
Ballroom B, Level 1


101 Ways to Crash Your Cluster [I] - Marius Grigoriu & Emmanuel Gomez, Nordstrom
Running a kubernetes cluster requires operating many components. One must be good at running and scaling etcd, multiple control plane components, a monitoring system, a logging pipeline, Docker, rkt, and Linux itself. And this list isn't even close to being complete. With such a long list of technologies comes the potential to make a mistake that brings the whole cluster down. Come hear war stories from the Nordstrom's Kubernetes cluster admins. Each is a true story of how the cluster melted down, how they recovered, and what they did to prevent it from happening again. Don't let any of these happen to you...

avatar for Emmanuel Gomez

Emmanuel Gomez

Principal Engineer, Nordstrom
Emmanuel initiated and served as tech lead on the Kubernetes platform efforts at Nordstrom for the last three years. He was working with and advocating for containers before the Kubernetes 1.0 release and has continuously (and tirelessly) developed, operated, educated, and led containerization... Read More →
avatar for Marius Grigoriu

Marius Grigoriu

Sr Manager, Nordstrom
Marius Grigoriu leads the teams responsible for all of the major tools along the software delivery pipeline: issue tracking, version control, continuous integration and deployment, and production through the use of Kubernetes. His focus is to help teams ship high quality systems on... Read More →

Thursday December 7, 2017 4:35pm - 5:10pm
Ballroom B, Level 1
Friday, December 8


Moving from Mesos to Kubernetes Without Anyone Noticing [I] - Anubhav Mishra, Hootsuite
At Hootsuite, we’ve been using Mesos and Marathon as our microservices platform for over two years but last year, we made the decision to bet on Kubernetes as its replacement. Eight months later, a small team of three operations engineers had migrated our first microservice from Mesos to Kubernetes. All without developers making any code changes. This was possible by architecting our applications with the proper set of abstractions. Fast-forward three months later and we have almost 20 microservices running on Kubernetes in production.

In this session, we’ll do a live demo of migrating a service from Mesos to Kubernetes, just like how we did it at Hootsuite! We will cover why architecting your infrastructure with the “right” abstractions helps you do these huge migrations with ease and how Kubernetes already contains these abstractions. We will explore how having a service mesh helps routing between two platforms while doing the migration. Also, how a mature CI/CD pipeline can help you deploy to two platforms with ease. To conclude we will explore the differences in running a service in Mesos and Kubernetes.

avatar for Anubhav Mishra

Anubhav Mishra

Developer Advocate, HashiCorp
Anubhav Mishra is a Developer Advocate at HashiCorp. He previously worked at Hootsuite. At Hootsuite he was focused on building cloud infrastructure and distributed systems. His work spans developers and operators. He helped create the next generation microservice delivery platform... Read More →

Friday December 8, 2017 11:10am - 11:45am
Ballroom B, Level 1


Kubernetes in the Datacenter: Squarespace’s Journey Towards Self-Service Infrastructure [I] - Kevin Lynch, Squarespace
As Squarespace’s engineering organization evolved, microservices became an obvious solution to quickly deliver new features and improve infrastructure reliability. We encountered significant challenges in our transition to a microservice-based architecture. Each new service increased the operations burden to provision and maintain a growing fleet of servers, frequently slowing the process of adding new services and scaling existing services in our datacenters.

I’ll discuss how we used Kubernetes to containerize our microservice ecosystem and solve those challenges. To effectively work with ephemeral Kubernetes pods, we replaced Graphite with Prometheus and Sensu with AlertManager to monitor service health rather than individual instances. We discovered massive performance issues containerizing our Java services and worked around JVM complexities. To ease our transition from virtualization to containerization, services running inside and outside of Kubernetes must seamlessly discover each other with Consul and communicate with each other. Thanks to Calico, BGP, and our Leaf-Spine Layer 3 network topology, we efficiently route pod network traffic with the rest of our network.

avatar for Kevin Lynch

Kevin Lynch

Squarespace, Squarespace
Kevin Lynch is a Staff Engineer on the Infrastructure Engineering team at Squarespace. He focuses his efforts on eliminating the complexities of datacenters with the help of automation. He received his BSc and MSc degrees in Computer Science from Drexel University. During his time... Read More →

Friday December 8, 2017 11:55am - 12:30pm
Ballroom B, Level 1


Using Kubernetes to Change Legacy Systems and Processes in the Public Sector [B] - Audun Fauchald Strand, Norwegian Welfare Administration
Kubernetes is the implementation of the modern software development process. Continuous Release and “you built it, you run it”. For the last few years I have been working on introducing kubernetes into an organization with continuous release, microservices and “you build it, you run it”, as presented at Kubecon in Berlin 2017.

Now I work for the public sector in Norway, where the systems are old, and the processes are older. I will present the experiences from working on changing these legacy organisations, using containers and kubernetes as the main tool. I will cover:
  • migrating old legacy apps to kubernetes, is it possible
  • manual testing done easy with containers
  • monitoring for everyone
  • making a PAAS that everyone can use
  • stable and robust deployment, but not just 4 times a year
  • how to leverage all the hardware that is owned by the public sector

avatar for Audun Fauchald Strand

Audun Fauchald Strand

Team Lead - Platform and automation, NAV - Norwegian Welfare Administration
k8s, ddd, jvm, Kafka, distributed systems, testing, Tottenham. Almost called "Large viking shaped Norwegian" in LWN

Friday December 8, 2017 2:00pm - 2:35pm
Ballroom B, Level 1


From Monolith to Microservices with Kubernetes and Linkerd - Mason Jones, Credit Karma
After about eight years, Credit Karma had built up an impressive tech infrastructure...based on a PHP monolith. Over the past 18 months we’ve (carefully) adopted Docker, Linkerd, Consul, Kubernetes, and more as we shifted to microservices in order to enable continued engineering innovation. This is the story of our evolution from monolith to microservices, starting with our own homegrown tools. The talk will cover our iterations from basic plumbing to dynamic service discovery; why we started using Linkerd and selected Kubernetes; and how we evolved our systems step by step while continuing to serve 75 million members.

avatar for Mason Jones

Mason Jones

Senior Staff Infrastructure Engineer, Credit Karma
Mason is a technical leader in Credit Karma’s platform engineering department. His work has included spearheading the company's migration to microservices; integrating Kubernetes, service mesh, and Vault; and migration to public cloud. Before joining Credit Karma, Mason spent more... Read More →

Friday December 8, 2017 2:45pm - 3:20pm
Ballroom B, Level 1


Evolving and Supporting Stateful, Multi-Tenant Decisioning Applications in Production [A] - Keith Gasser, Capital One
With our adoption of Kubernetes at Capital One, we have simultaneously reduced our application delivery time-to-market while providing a common platform for streaming pipelines. We leverage Kubernetes to manage stateful decisioning applications for multiple tenants and provide a host of analytical tools as platform services to help data scientists iteratively improve decision models. We will discuss the challenges in operating these pipelines which consist of Apache Nifi canvases/flows for data ingress/egress, Kafka as persistent stream backbone, Flink for decisioning, and a number of other popular open source data analytics packages such as Apache Drill and Zeppelin forming our “Analytical Environment.”


Keith Gasser

Director, Distinguished Engineer, Capital One
Keith is a Software Engineer specializing in DevOps and Application Security at Capital One currently working on a team which has built a Kubernetes-based streaming and decisioning pipeline for Capital One Bank.

Friday December 8, 2017 3:40pm - 4:15pm
Ballroom B, Level 1


The Oregon Trail to Kubernetes [I] - Joshua Roppo, Lytics
Can a small team operating 2000 CPUs, escape the glorified bash infighting of Configuration Management to homestead the scalable compute plains of Kubernetes? A journey of transitioning from Google’s Compute Engine to the blessed Container Engine.

The route we chose diverged from the never ending landscape of single purpose YAML tutorials and retreading Configuration Management tools with templating. Instead, we chose a mountain pass of defining Kubernetes Resources as Go code for compiled type checking, composability, validation, and potential for extension. The case study of a small team breaking trail through ecosystems of application design, schedule paradigms, deprecation dysentery, and holding legacy together with bailing wire. A retrospective of value added versus time wasted on the path to great opportunities on Kubernetes.

Talk Overview: Lytics Stack and overview(whoami) Loading the Wagon: Design and decision considerations(Read the Borg Paper) Deprecation Dysentery: Wait wait don’t use that. Mirages of disappointment: Systems which couldn’t make it to Kubernetes. Compute Resource Hunting Massacre: Avoiding compute underuse; taking advantage of scheduler. Handyman’s Corner: The bailing wire and zip-tie Kubernetes tools and services built to keep the broken axle(legacy systems) intact through the journey. Blizzards of the Kubernetes: from a user’s perspective who can’t follow every SIG; surviving the avalanche of ecosystem changes. Cascadia found: the wins, plans to rebuild what was abandoned, and breathing the free air. Next: Sim City

avatar for Joshua Roppo

Joshua Roppo

Infrastructure Engineer, X
Platform Operations Engineer with a preference to write code over Bash. Managing operational decisions and transitions at Lytics for three years where we turn raw user and event data into actionable personalization APIs for marketing. Pedantic gopher who enjoys the challenges of... Read More →

Friday December 8, 2017 4:25pm - 5:00pm
Ballroom B, Level 1