UPDATED 17:22 EST / JULY 20 2017


A game of data science: the analytics architecture behind Riot Games

With the help of modern analytics, Riot Games Inc. developed a highly successful computer game called League of Legends, in which players form teams of champions and compete with other players around the world. Wesley Kerr (pictured), senior data scientist at Riot Games, explained how his organization is leveraging data science to improve player experience and weed out unsavory behavior.

“[In] about 2 percent of our games there is some form of serious abuse that comes in the form of hate speech, racism and sexism, things that have no place in the game.” Kerr said. “Right now it’s purely based on things said in chat, but we’re investigating other ways of measuring that behavior.”

Kerr gave a keynote speech at this year’s Spark Summit in San Francisco, California, and afterwards spoke with David Goad (@davidgoad) and George Gilbert (@ggilbert41), co-hosts of theCUBE, SiliconANGLE Media’s mobile live streaming studio, to dive into more detail about Riot Game’s data science stack. (* Disclosure below.)

A DataBricks power player experience engine

Kerr described what is under the hood at Riot Game’s data science organization. “We rely on DataBricks for all of our deployments. We do many different clusters and have about 14 different data scientists that work with us. Each one is able to manage their own cluster, spin them up tear them down, find their data and work with it through DataBricks,” Kerr explained.

Kerr went on to explain the configuration of the data warehouse itself and how they manage the sheer scale of data being processed.

“We’re able to leverage the power of our players; we have 100 million. … All the data flows into a hive data warehouse stored in S3. We have two different ways of interacting with it. We can run queries against Hive, which tends to be a little slower for our use cases. Our data scientists tend to access to all of that data through DataBricks and Spark, which runs much quicker for our use cases.”

Watch the complete video interview below, and be sure to check out more of SiliconANGLE’s and theCUBE’s coverage of Spark Summit 2017(* Disclosure: DataBricks Inc. sponsored this Spark Summit 2017 segment on SiliconANGLE Media’s theCUBE. Neither DataBricks nor other sponsors have editorial control over content on theCUBE or SiliconANGLE.)

Photo: SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy