Who will be speaking at Data Day Seattle?

The following speakers are confirmed for Data Day Seattle 2017. There are 50+ more to follow. Check this page regularly for updates. We are currently accepting proposals.

Tyler Akidau (Seattle) @takidau

Tyler Akidau (Linkedin) is a staff software engineer at Google Seattle. He leads technical infrastructure’s internal data processing teams (MillWheel & Flume), is a founding member of the Apache Beam PMC, and has spent the last seven years working on massive-scale data processing systems. Though deeply passionate and vocal about the capabilities and importance of stream processing, he is also a firm believer in batch and streaming as two sides of the same coin, with the real endgame for data processing systems the seamless merging between the two. He is the author of the 2015 Dataflow Model paper and the Streaming 101 and Streaming 102 articles on the O’Reilly website. His preferred mode of transportation is by cargo bike, with his two young daughters in tow.
Tyler will be giving the following presentation: Foundations of Streaming SQL or: How I Learned to Love Stream & Table Theory

Dave Bechberger (Houston)

Dave Bechberger is a Sr. Architect at Gene by Gene, a genetic genealogy and bioinformatics company, where he works extensively on developing their next-generation data architecture. Dave has spent his career engaging in full stack software development but specializes in building data architectures in complex data domains such as bioinformatics, oil and gas, supply chain management, etc. He uses his knowledge of graph and other big data technologies to build out highly performant and scalable systems. Dave has previously spoken at a variety of international technical conferences including NDC Oslo, NDC London, and Graph DayTexas.
Dave will be speaking as part of Graph Day Seattle.

Ryan Boyd (SF Bay)

Ryan Boyd (Linkedin) is a SF-based software engineer focused on helping developers understand the power of graph databases. Previously he was a product manager for architectural software, built applications and web hosting environments for higher education, and worked in developer relations for twenty products during his 8 years at Google. He enjoys cycling, sailing, skydiving, and many other adventures when not in front of his computer.
Ryan will be giving the following Graph Day Seattle presentation: Combining graph analytics with real-time graph query workloads for solving business problems.

Nikhil Buduma (SF Bay) @nkbuduma

Nikhil Buduma (Linkedin) is Co-founder and Chief Scientist at Remedy Health, a San Francisco-based company that is building a new system for data-driven primary healthcare.
At the age of 16, he managed a drug discovery laboratory at San Jose State University and developed novel low-cost screening methodologies for resource-constrained communities. By the age of 19, he was a two-time at the International Biology Olympiad. He later attended MIT, where he focused on developing large scale data systems to impact healthcare delivery, mental health, and medical research. At MIT, he co-founded Lean On Me, a national non-profit organization that provides an anonymous text hotline to enable effective peer support on college campus and leverages data to effect positive mental health and wellness outcomes.
Today, Nikhil spends his free time investing in hard technology and data companies through his venture fund, Q Venture Partners, and managing a data analytics team for the Milwaukee Brewers baseball team.

Sanghamitra Deb (SF Bay) @sangha_deb

Sanghamitra Deb is a Data Scientist at Accenture Technology Laboratory. As a data scientist at a Accenture she has worked on a wide variety of problems related data modeling, architecture and visual story telling. She has also worked in multiple data roles in different projects. Her primary focus is application of Natural Language Processing and Machine Learning to enterprise data. She is active in Data Science outreach and believes in applying analytics to a range of domains such as pharma, HR, customer support, market research, etc. Prior to being data scientist she was an astrophysicist who studied the structure of the universe by modeling galaxy clusters.
Sanghamitra will be speaking as part of NLP Day Seattle.

Mike Downie (Bryan/College Station)

Mike Downie, Technical Lead at Expero, is driven to solve technical problems in the most practical way possible. To do this well he developed a diverse set of skills to truly understand the problem(s) at hand and a depth of technical knowledge to create great solutions.
Lifelong interests in technology, math, and science and that drive to solve problems led Mike to pursue a Computer Science degree at Texas A&M. He wasn’t content to learn about software in a classroom, he wanted to apply concepts in real-world settings as soon as possible. His final three years of college he worked professionally developing software. First as a co-op student for a (then) national telecommunications company and later part-time for a software consulting company. This ‘dual education’ allowed Mike to graduate with a strong academic foundation and the practical skills to apply his knowledge.
After completing his degree Mike continued to work for the consulting company, helping the company grow from 4 employees to more than 60 over the course of a decade. He gained expertise through success on a diverse set of projects. Project work included data acquisition and control, database applications and user interface design. During this period Mike also built software teams, studying software project management, estimation, requirements gathering, application lifecycle management and learning to effectively mentor junior developers.
Mike’s most recent work includes database design and front-end development for enterprise resource planning systems. He continues to look for difficult problems needing great solutions. What makes a great solution? He says first and foremost it has to work well, meeting both the functional and non-functional requirements. A solution can’t just work it has to work fast enough, be intuitive to the user and be maintainable by the developers to be great.
Mike will be giving the following Graph Day Seattle presentation: Graph Data Obfuscation

Garrett Eastham (Austin) @data_exhaust

Garrett Eastham is a practicing data scientist and serial entrepreneur working at the intersection of Artificial Intelligence and Digital Commerce. He has been working within enterprise ecommerce for the past 6 years - working exclusively with leading retail technology innovators such as Edgecase (which he founded) and RetailMeNot.

Dr. Denise Koessler Gosnell (Charleston) @DeniseKGosnell

In August 2017, Dr. Denise Gosnell, transitioned into a Solutions Architect position with DataStax where she aspires to build upon her experiences as a data scientist and graph architect to further their established line of graph solutions. Prior to her role with DataStax, Dr. Gosnell was a Data Scientist and Technology Evangelist at PokitDok. During her three years with PokitDok, she built software solutions for and spoke at over a dozen conferences on permissioned blockchains, machine learning applications of graph analytics, and data science within the healthcare industry.
Dr. Gosnell earned her Ph.D. in Computer Science from the University of Tennessee. Her research on how our online interactions leave behind unique identifiers that form a “social fingerprint” led to presentations at major conferences from San Diego to London and drew the interest of such tech industry giants as Microsoft Research and Apple. Additionally, she was a leader in addressing the underrepresentation of women in her field and founded a branch of Sheryl Sandberg’s Lean In Circles.
Dr. Gosnell will be speaking as part of Graph Day Seattle.

Holden Karau (San Francisco) @holdenkarau

Holden Karau is a software development engineer and is active in open source. She a co-author of Learning Spark & Fast Data Processing with Spark and has taught intro Spark workshops. Prior to IBM she worked on a variety of big data, search, and classification problems at Alpine, DataBricks, Google, Foursquare, and Amazon. She graduated from the University of Waterloo with a Bachelors of Mathematics in Computer Science. Outside of computers she enjoys dancing & playing with fire.
Holden will be holding the following session: Extending Spark Machine Learning - Adding your own algorithms & tools.
While at Data Day, Holden will be holding office hours and signing copies of her O'Reilly book: High Performance Spark.

Gunnar Kleemann (Berkeley / Austin)

Gunnar Kleemann is a Data Scientist with the Berkeley Data Science Group (BDSG). He is interested in how data science facilitates biological discovery and lowers the barrier to high-throughput research, particularly in small, independent labs. In addition to his work with BDSG, he is also involved in the development and implementation of technologies like the ATX Hackerspace Biology Laboratory.
Gunnar holds a PhD in Molecular Genetics from Albert Einstein College of Medicine and a Master’s in Data Science from UC Berkeley. He did post-doctoral research on the genomics of aging at Princeton University, where his research focused developing high throughput robotic assays to understand how genetic changes alter lifespan and reproductive biology.
Gunnar will be speaking as part of Graph Day Seattle.

Zornitsa Kozareva (SF Bay) @zkozareva

Zornitsa Kozareva (Linkedin) is a Machine Leaning Manager at Amazon Alexa, where she leads a team of natural language processing and machine learning scientists who work on developing the new capabilities for the Alexa platform and third party applications. Prior to Amazon, Zornitsa was a Senior Manager at Yahoo! leading the Query Processing group that powers Mobile Search and Advertisement. From 2009 to 2014, Dr. Kozareva was a Research Assistant Professor at the University of Southern California and a Research Scientist at the Information Sciences Institute. Her interests lie in Web-based knowledge acquisition, semantics, ontology population, multilingual information extraction and sentiment analysis. Dr. Kozareva regularly serves as Area Chair and PC of top tier NLP conferences. She has organized four SemEval scientific challenges and has published over 60 research papers. Dr. Kozareva is a recipient of the John Atanasoff Award given by the President of Republic of Bulgaria in 2016 for her contributions and impact in science, education and industry; the Yahoo! Labs Excellence Award in 2014 and the RANLP Young Scientist Award in 2011.
Zornitsa will be speaking as part of NLP Day Seattle.

Stefan Krawczyk (San Francisco) @stefkrawczyk

Stefan Krawczyk loves the stimulus of working at the intersection of design, engineering, and data. He spent formative years at Stanford, LinkedIn, Nextdoor & Idibon, working on everything from growth engineering, product engineering, data engineering, to recommendation systems, NLP, data science and business intelligence. At Stitch Fix he’s leading development of the algorithm development platform.
Stefan will be speaking as part of NLP Day Seattle.

Denny Lee (Seattle) @ dennylee

Denny Lee is a Principal Program Manager at Microsoft for the Azure CosmosDB team - Microsoft's globally distributed, multi-model database. He is a hands-on distributed systems and data sciences engineer with extensive experience developing internet-scale infrastructure, data platforms, and predictive analytics systems for both on-premise and cloud environments. Prior to joining the Azure CosmosDB team, Denny worked as a Technology Evangelist at Databricks; he has been working with Apache Spark since 0.5. He was also the Senior Director of Data Sciences Engineering at Concur, and was on the incubation team that built Microsoft's Hadoop on Windows and Azure service (currently known as HDInsight). Denny also has a Masters of Biomedical Informatics from Oregon Health and Sciences University.
Denny will be co-presenting the following Graph Day Seattle presentation: Applying an active learning algorithm for entity de-duplication in graph data

William Lyon (SFBay) @lyonwj

William Lyon is a software developer at Neo4j, the open source graph database. As an engineer on the Developer Relations team, he works primarily on integrating Neo4j with other technologies, building demo apps, helping other developers build applications with Neo4j, and writing documentation. Prior to joining Neo, William worked as a software developer for several startups in the real estate software, quantitative finance, and predictive API fields. William holds a Masters degree in Computer Science from the University of Montana. You can find him online at lyonwj.com.
William will be giving the following Graph Day Seattle presentation: Applying an active learning algorithm for entity de-duplication in graph data

Charity Majors (San Francisco) @mipsytipsy

Charity Majors is an engineer and cofounder at Honeycomb - a new startup focused on making impossible data observability problems not only possible, but exploratory and inviting for teams of all types. Prior to Honeycomb, Charity led the Parse infrastructure team as they grew Parse from a handful of mobile apps to over a million, then worked as an engineering manager at Facebook, while pairing closely with the RocksDB team to develop and roll out the world's first Mongo+Rocks in production. She is a reluctant DBA who has spent far too much time running Mongo, Cassandra, Mysql, Redis, and probably more but those brain cells are gone. Charity is co-author ( with Laine Campbell) of the upcoming O'Reilly book Database Reliability Engineering and loves single malt scotch.
Charity's GitHub
Charity Majors will be giving the keynote for Data Day Seattle.

Yashar Mehdad (San Francisco)

Yashar Mehdad is a senior ML/NLP Scientist at Airbnb, and a frequent speaker at NLP conferences -- including EMNLP and ACL.
Yashar will be speaking as part of NLP Day Seattle.

Christian Miles (Ontario, Canada)

Christian Miles is the Technical Sales Manager at Cambridge Intelligence. Since completing his Masters in Maths & Computer Science at Bristol University in the UK, Christian has specialized in graph visualization software for global enterprise deployments. In his roles at BAE Systems and the Wynyard Group, Christian's focus has been applying graph network analysis in Financial Crime, Cyber and Law Enforcement domains. Christian is a North American Sales Engineer for Cambridge Intelligence, makers of KeyLines, and is based out of Canada.
Christian will be giving the following Graph Day Seattle presentation: Exploring the graph database landscape through graph visualization.

Ryan Mitchell (Somerville, MA) @Kludgist

Ryan Mitchell (Linkedin) is a senior software engineer at HedgeServ , She received her master's in software engineering from Harvard University, Extension School, and a bachelor's in Engineering at Olin College of Engineering. Prior to joining HedgeServ, Ryan was a Software Engineer building web scrapers and bots at Abine Inc. Ryan is the author of two books about web scraping: Web Scraping with Python (O’Reilly, 2015), and Instant Instant Web Scraping with Java (Packt, 2013), as well as an upcoming O’Reilly video series: Web Crawling with Python.

Christopher Moody (San Francisco) @chrisemoody

Chris Moody loves high-performance computing, high dimensions & high fashion. He loves learning the beautiful symmetries between physics, data, and analytics. Went to Caltech, did astrostats & supercomputing and now Data Labs at Stitch Fix. Currently enjoying coding up word2vec, Gaussian Processes, Deep RNNs and t-SNE.
Christopher will be speaking as part of NLP Day Seattle.

Jonathon Morgan (Austin) @jonathonmorgan

Jonathon Morgan (Linkedin) is Founder and CEO at New Knowledge. a company building technologies to understand and predict human behavior. As part of his ongoing work applying quantitative methods to combating violent extremism, he served as an advisor to the White House and State Department, co-authored the ISIS Twitter Census for the Brookings Institution, and develops new technology with DARPA. Jonathon is also the co-host of Partially Derivative, an unrealistically popular podcast about data science and drinking.

Jonathan Mugan (Austin) @jmugan

Jonathan Mugan (Linkedin) is Co-Founder and CEO at DeepGrammar. Dr. Mugan specializes in artificial intelligence and machine learning. His current research focuses in the area of deep learning, where he seeks to allow computers to acquire abstract representations that enable them to capture subtleties of meaning. Dr. Mugan received his Ph.D. in Computer Science from the University of Texas at Austin. His thesis was centered in developmental robotics, which is an area of research that seeks to understand how robots can learn about the world in the same way that human children do. Dr. Mugan also held a post-doctoral position at Carnegie Mellon University, where he worked at the intersection of machine learning and human-computer interaction. He is also the author of The Curiosity Cycle: Preparing Your Child for the Ongoing Technological Explosion.

Robert Munro (San Francisco) @WWRob

Robert Munro (Linkedin), VP of Machine Learning at Crowdflower, is an expert in combining Human and Machine Intelligence, working with Machine Learning approaches to Text, Speech, Image and Video Processing. Robert has founded several AI companies, building some of the top teams in Artificial Intelligence. He has worked in many diverse environments, from Sierra Leone, Haiti and the Amazon, to London, Sydney and Silicon Valley, in organizations ranging from startups to the United Nations. He most recently ran Product for AWS’s first Natural Language Processing services in the Deep Learning team at Amazon AI.
Robert has published more than 50 papers and is a regular speaker about technology in an increasingly connected world. He has a PhD from Stanford University.
Rob will be speaking as part of NLP Day Seattle.

Josh Perryman (Bryan / College Station) @joshperryman

Josh Perryman, Data Architect at Expero, likes to play with data. Oftentimes this is implementing proprietary algorithms closer to the data for performance or scale. Sometimes it is ad-hoc investigation and analysis, a sort of exploratory querying. A few times he’s been able to leverage his experience with data engines for dramatic performance improvements. But the real joy is designing a schema for both functionality and performance, one which increases the productivity of other developers and enables a technology to solve new problems or deliver new value to the business.
But technology isn't just data, and he does more than just play with data. He’s worked with high performance computing (HPC) environments, taking computations from hours to minutes or seconds. He has built visualizations which deliver new insights into complex data domains. He’s managed technology personnel, both directly and indirectly, to deliver technology solutions. He’s have put together more types of technology components, software and hardware, than can be counted, because one of his fortes is solving problems by building sustainable systems.
Josh will be giving the following Graph Day Seattle presentation: Securing Federated Data with TinkerPop and how to handle the “search engine” problem
Josh will also be offering a Thursday afternoon workshop: Hands-on Introduction to TinkerPop and the Gremlin Query Language.

Alan Pita (Austin)

Alan Pita, Graph Developer and Architect at Expero, has 20+ years of experience as a developer and architect for high-performance software-hardware systems scaling to multiple data centers with thousands of participating nodes each. He specializes in helping firms productize and monetize complex software technology from emerging research. He has three patents stemming from 10 years of work at IBM’s Server Division. He has led globally distributed technical teams, mined and managed agile product requirements, and built a proven track record of delivering boundary-defying technical innovations. Alan brings extensive technical experience in the areas of programming languages, complex SoC design, computer system architecture and functional design verification. Alan has a Bachelor’s Degree in Computer Science and Engineering from Texas A&M University and a Master's Degree in Computer Science from Stanford University. He is currently on hiatus from the Ph.D. program in Electrical and Computer Engineering at the University of Texas at Austin. He is also a graduate of the IBM Leadership Excellence course.
Alan will be giving the following Graph Day Seattle presentation: Interactive prototyping of Graph Applications with JanusGraph

Graph Keynote - Haikal Pribadi (London) @ haikalpribadi

Haikal Pribadi is the Founder and CEO of GRAKN.AI, the database for AI. His interest in the field began at the Monash Intelligent Systems Lab, where he built an open source driver for the Parallax Eddie Robot which was then adopted by NASA. After which, he completed a masters degree in AI from the University of Cambridge. Haikal was also the youngest Algorithm Expert behind Quintiq’s Optimisation Technology behind some of the world’s largest supply chain systems in transportation, retail and logistics. He now works on GRAKN.AI, a distributed knowledge base with that uses machine reasoning to handle and interpret complex data. GRAKN.AI was recently awarded Product of the Year 2017 by the University of Cambridge Computer Lab.
Haikal will be speaking as part of Graph Day Seattle.

Steve Purves

Steve Purves, Senior Software Developer at Expero, describes himself as an engineer first and foremost. He is comfortable working full-stack, cross-platform in a range of languages and is happiest when there is some mathematical or scientific analysis sprinkled in. He graduated in electrical engineering specializing in signal and image processing, which he took into the scientific computing field in the Oil and Gas industry.
During that time his work was largely split into three: development of low-level number-crunching libraries (C, C++, CUDA) and the cross-platform desktop application with 3D visualization to drive it; applied research in signal processing, numerical analysis algorithm development for 3D seismic analysis, during which he was an IEEE journal geek; and finally management of R&D and Product development teams as CTO, championing practices like TDD, BDD and Agile to get it done.
Around 5 years ago, the excitement of daily binary builds wore thin and Steve got hooked on building applications for the web, starting out with web-desktop integration work for seismic analysis on the iPad. Since then activities have included working on full-stack web applications, with and without desktop integration, for startups in sectors such as Dental, TV Production and Software Micro-Consulting.
Steve will be giving the following Graph Day Seattle presentation: Graph Representations in Machine Learning

Aravind Krishna R (Seattle) @ arkramac

Aravind Krishna R is a Principal Program Manager with the Azure Cosmos DB team, Microsoft’s new globally distributed, multi-model database service. He is one of the team members responsible for building the new Gremlin graph APIs for Azure Cosmos DB. He has worked in various roles building databases and distributed systems, and has been with the Azure Cosmos DB team for the last 4 years.


Nelson Ray (SF Bay)

Nelson Ray manages the Risk Science group at Opendoor in San Francisco. His team is responsible for pricing the fee for Opendoor's home buying service and for optimizing resale strategy using a variety of machine learning models and experimental techniques. Prior to joining Opendoor, Nelson was a data scientist at Google and a software engineer at Metamarkets. He holds a BS in mathematics and an MS and PhD in statistics from Stanford University.


Julia Silge (Salt Lake City) @juliasilge

Julia Silge (LinkedIn / GitHub) is a data scientist at Stack Overflow. She enjoys making beautiful charts, the statistical programming language R, black coffee, red wine, and the mountains of her adopted home here in Utah. She has a PhD in astrophysics and an abiding love for Jane Austen. Her work involves analyzing and modeling complex data sets while communicating about technical topics with diverse audiences.
Julia will be speaking as part of NLP Day Seattle..

Shireesh Thota (SF Bay) @ shireeshThota

Shireesh Thota is an Engineering Manager at Azure Cosmos DB, where he is responsible for managing the teams around the Programmability aspects. Shireesh joined Microsoft in 2007 and has worked on SQL Server, SQL DB and Cosmo DB. At Cosmos DB, his focus has primarily been around designing, evolving and engineering Logical Indexing, Query architecture, Scaling, hosting Programming runtimes and Interoperability layers. In the past he has worked on SQL Server, built some of the key components around DAC framework, and Import / Export services for SQL DB.
Shireesh will be speaking as part of Graph Day Seattle.

Denis Vrdoljak (SF Bay)

Denis Vrdoljak (Co-Founder and Managing Director at the Berkeley Data Science Group (BDSG)): Denis is a Berkeley trained Data Scientist and a Certified ScrumMaster (CSM), with a background in Project Management. He has experience working with a variety of data types-- from intelligence analysis to electronics QA to business analytics. In Data Science, his passion and current focus is in Machine Learning based Predictive Analytics and Network Graph Analysis. He holds a Master's in Data Science from the UC Berkeley and a Master's in International Affairs from Texas A&M.
Denis will be speaking as part of Graph Day Seattle.

Rachel Warren (San Francisco) @warre_n_peace

Rachel Warren is a Software Engineer in Data Science at Alpine Data Labs in San Francisco. She is a Spark engineer, functional programmer, and data scientist. She has worked on financial, political, and natural language problems. In addition to coding she is passionate about teaching and working with people. She has taught computer science and math in Ghana, and is now helping educate her peers on Spark in San Fransisco. Rachel, along with Holden Karau, is co-author of the best-selling O’Reilly title High Performance Spark.

Trey Wilson

As a software consultant for Expero, Trey Wilson focuses on getting real code working quickly. Trey loves to work with the newest technologies, but only when they serve to achieve a pragmatic goal.
As soon as the “middle-of-nowhere” town that Trey grew up in had the Internet, he taught himself to program and hasn’t looked back since. He has programmed everything from displaying real-time geospatial communications data to image recognition using deep learning algorithms. The quickest way to get Trey interested in a problem is to say “This looks difficult.” Trey enjoys traveling to new cities to see if they have any good craft beers, reading up on esoteric subjects in case they ever become useful, and cooking like an amateur.
Trey will be giving the following Graph Day Seattle presentation: Graph Representations in Machine Learning

Dr. Yu Xu (Redwood City)

Dr. Yu Xu is the President and founder of GraphSQL. He received his Ph.D in Computer Science and Engineering from UC San Diego. He is an expert in Big Data and Parallel Database Systems. He has 26 patents in parallel data management and optimization. He worked on Twitter’s data infrastructure and analytics for massive data analytics in Twitter. Before Twitter, as Teradata’s Hadoop architect and team leader, he led Teradata’s initiatives in Big Data.
Dr. Xu will be speaking as part of Graph Day Seattle.