Who will be speaking at Data Day Seattle?

The following speakers are confirmed for Data Day Seattle 2017. There are 50+ more to follow. Check this page regularly for updates. We are currently accepting proposals.

Tyler Akidau (Seattle) @takidau

Tyler Akidau (Linkedin) is a staff software engineer at Google Seattle. He leads technical infrastructure’s internal data processing teams (MillWheel & Flume), is a founding member of the Apache Beam PMC, and has spent the last seven years working on massive-scale data processing systems. Though deeply passionate and vocal about the capabilities and importance of stream processing, he is also a firm believer in batch and streaming as two sides of the same coin, with the real endgame for data processing systems the seamless merging between the two. He is the author of the 2015 Dataflow Model paper and the Streaming 101 and Streaming 102 articles on the O’Reilly website. His preferred mode of transportation is by cargo bike, with his two young daughters in tow.
Tyler will be giving the following presentation: Foundations of Streaming SQL or: How I Learned to Love Stream & Table Theory

(NEW) - Dave Bechberger (Houston)

Dave Bechberger is a Sr. Architect at Gene by Gene, a genetic genealogy and bioinformatics company, where he works extensively on developing their next-generation data architecture. Dave has spent his career engaging in full stack software development but specializes in building data architectures in complex data domains such as bioinformatics, oil and gas, supply chain management, etc. He uses his knowledge of graph and other big data technologies to build out highly performant and scalable systems. Dave has previously spoken at a variety of international technical conferences including NDC Oslo, NDC London, and Graph DayTexas.

Nikhil Buduma (SF Bay) @nkbuduma

Nikhil Buduma (Linkedin) is Co-founder and Chief Scientist at Remedy Health, a San Francisco-based company that is building a new system for data-driven primary healthcare.
At the age of 16, he managed a drug discovery laboratory at San Jose State University and developed novel low-cost screening methodologies for resource-constrained communities. By the age of 19, he was a two-time at the International Biology Olympiad. He later attended MIT, where he focused on developing large scale data systems to impact healthcare delivery, mental health, and medical research. At MIT, he co-founded Lean On Me, a national non-profit organization that provides an anonymous text hotline to enable effective peer support on college campus and leverages data to effect positive mental health and wellness outcomes.
Today, Nikhil spends his free time investing in hard technology and data companies through his venture fund, Q Venture Partners, and managing a data analytics team for the Milwaukee Brewers baseball team.

Sanghamitra Deb (SF Bay) @sangha_deb

Sanghamitra Deb is a Data Scientist at Accenture Technology Laboratory. As a data scientist at a Accenture she has worked on a wide variety of problems related data modeling, architecture and visual story telling. She has also worked in multiple data roles in different projects. Her primary focus is application of Natural Language Processing and Machine Learning to enterprise data. She is active in Data Science outreach and believes in applying analytics to a range of domains such as pharma, HR, customer support, market research, etc. Prior to being data scientist she was an astrophysicist who studied the structure of the universe by modeling galaxy clusters.
Sanghamitra will be speaking as part of NLP Day Seattle.

Mike Downie (Bryan/College Station)

Mike Downie, Technical Lead at Expero, is driven to solve technical problems in the most practical way possible. To do this well he developed a diverse set of skills to truly understand the problem(s) at hand and a depth of technical knowledge to create great solutions.
Lifelong interests in technology, math, and science and that drive to solve problems led Mike to pursue a Computer Science degree at Texas A&M. He wasn’t content to learn about software in a classroom, he wanted to apply concepts in real-world settings as soon as possible. His final three years of college he worked professionally developing software. First as a co-op student for a (then) national telecommunications company and later part-time for a software consulting company. This ‘dual education’ allowed Mike to graduate with a strong academic foundation and the practical skills to apply his knowledge.
After completing his degree Mike continued to work for the consulting company, helping the company grow from 4 employees to more than 60 over the course of a decade. He gained expertise through success on a diverse set of projects. Project work included data acquisition and control, database applications and user interface design. During this period Mike also built software teams, studying software project management, estimation, requirements gathering, application lifecycle management and learning to effectively mentor junior developers.
Mike’s most recent work includes database design and front-end development for enterprise resource planning systems. He continues to look for difficult problems needing great solutions. What makes a great solution? He says first and foremost it has to work well, meeting both the functional and non-functional requirements. A solution can’t just work it has to work fast enough, be intuitive to the user and be maintainable by the developers to be great.
Mike will be giving the following Graph Day Seattle presentation: Graph Data Obfuscation

Garrett Eastham (Austin) @data_exhaust

Garrett Eastham is a practicing data scientist and serial entrepreneur working at the intersection of Artificial Intelligence and Digital Commerce. He has been working within enterprise ecommerce for the past 6 years - working exclusively with leading retail technology innovators such as Edgecase (which he founded) and RetailMeNot.

Holden Karau (San Francisco) @holdenkarau

Holden Karau is a software development engineer and is active in open source. She a co-author of Learning Spark & Fast Data Processing with Spark and has taught intro Spark workshops. Prior to IBM she worked on a variety of big data, search, and classification problems at Alpine, DataBricks, Google, Foursquare, and Amazon. She graduated from the University of Waterloo with a Bachelors of Mathematics in Computer Science. Outside of computers she enjoys dancing & playing with fire.
Holden will be holding the following session: Extending Spark Machine Learning - Adding your own algorithms & tools.
While at Data Day, Holden will be holding office hours and signing copies of her O'Reilly book: High Performance Spark.

Jason Kessler (Seattle) @jasonkessler

Jason Kessler (LinkedIn) is a data scientist at CDK Global, where he analyses language use and consumer behavior in the online auto-shopping ecosystem. Prior to joining CDK, Jason was the founding data scientist at PlaceIQ and worked as a research scientist for JD Power and Associates. He has published peer-reviewed papers on algorithms and corpora for sentiment and belief analysis, and has sat on program committees and reviewed for several AI and NLP conferences. Most recently, he has delivered talks on the identification of persuasive and influential language language to the 2015 Sentiment Symposium and Data Day Seattle 2016.
Jason will be speaking as part of NLP Day Seattle.

Zornitsa Kozareva (SF Bay) @zkozareva

Zornitsa Kozareva (Linkedin) is a Machine Leaning Manager at Amazon Alexa, where she leads a team of natural language processing and machine learning scientists who work on developing the new capabilities for the Alexa platform and third party applications. Prior to Amazon, Zornitsa was a Senior Manager at Yahoo! leading the Query Processing group that powers Mobile Search and Advertisement. From 2009 to 2014, Dr. Kozareva was a Research Assistant Professor at the University of Southern California and a Research Scientist at the Information Sciences Institute. Her interests lie in Web-based knowledge acquisition, semantics, ontology population, multilingual information extraction and sentiment analysis. Dr. Kozareva regularly serves as Area Chair and PC of top tier NLP conferences. She has organized four SemEval scientific challenges and has published over 60 research papers. Dr. Kozareva is a recipient of the John Atanasoff Award given by the President of Republic of Bulgaria in 2016 for her contributions and impact in science, education and industry; the Yahoo! Labs Excellence Award in 2014 and the RANLP Young Scientist Award in 2011.
Zornitsa will be speaking as part of NLP Day Seattle.

Stefan Krawczyk (San Francisco) @stefkrawczyk

Stefan Krawczyk loves the stimulus of working at the intersection of design, engineering, and data. He spent formative years at Stanford, LinkedIn, Nextdoor & Idibon, working on everything from growth engineering, product engineering, data engineering, to recommendation systems, NLP, data science and business intelligence. At Stitch Fix he’s leading development of the algorithm development platform.
Stefan will be speaking as part of NLP Day Seattle.

 

Yashar Mehdad (San Francisco)

Yashar Mehdad is a senior ML/NLP Scientist at Airbnb, and a frequent speaker at NLP conferences -- including EMNLP and ACL.
Yashar will be speaking as part of NLP Day Seattle.

Ryan Mitchell (Somerville, MA) @Kludgist

Ryan Mitchell (Linkedin) is a senior software engineer at HedgeServ , She received her master's in software engineering from Harvard University, Extension School, and a bachelor's in Engineering at Olin College of Engineering. Prior to joining HedgeServ, Ryan was a Software Engineer building web scrapers and bots at Abine Inc. Ryan is the author of two books about web scraping: Web Scraping with Python (O’Reilly, 2015), and Instant Instant Web Scraping with Java (Packt, 2013), as well as an upcoming O’Reilly video series: Web Crawling with Python.

Christopher Moody (San Francisco) @chrisemoody

Chris Moody loves high-performance computing, high dimensions & high fashion. He loves learning the beautiful symmetries between physics, data, and analytics. Went to Caltech, did astrostats & supercomputing and now Data Labs at Stitch Fix. Currently enjoying coding up word2vec, Gaussian Processes, Deep RNNs and t-SNE.
Christopher will be speaking as part of NLP Day Seattle.

Jonathon Morgan (Austin) @jonathonmorgan

Jonathon Morgan (Linkedin) is Founder and CEO at New Knowledge. a company building technologies to understand and predict human behavior. As part of his ongoing work applying quantitative methods to combating violent extremism, he served as an advisor to the White House and State Department, co-authored the ISIS Twitter Census for the Brookings Institution, and develops new technology with DARPA. Jonathon is also the co-host of Partially Derivative, an unrealistically popular podcast about data science and drinking.

Jonathan Mugan (Austin) @jmugan

Jonathan Mugan (Linkedin) is Co-Founder and CEO at DeepGrammar. Dr. Mugan specializes in artificial intelligence and machine learning. His current research focuses in the area of deep learning, where he seeks to allow computers to acquire abstract representations that enable them to capture subtleties of meaning. Dr. Mugan received his Ph.D. in Computer Science from the University of Texas at Austin. His thesis was centered in developmental robotics, which is an area of research that seeks to understand how robots can learn about the world in the same way that human children do. Dr. Mugan also held a post-doctoral position at Carnegie Mellon University, where he worked at the intersection of machine learning and human-computer interaction. He is also the author of The Curiosity Cycle: Preparing Your Child for the Ongoing Technological Explosion.

Robert Munro (San Francisco) @WWRob

Robert Munro (Linkedin), VP of Machine Learning at Crowdflower, is an expert in combining Human and Machine Intelligence, working with Machine Learning approaches to Text, Speech, Image and Video Processing. Robert has founded several AI companies, building some of the top teams in Artificial Intelligence. He has worked in many diverse environments, from Sierra Leone, Haiti and the Amazon, to London, Sydney and Silicon Valley, in organizations ranging from startups to the United Nations. He most recently ran Product for AWS’s first Natural Language Processing services in the Deep Learning team at Amazon AI.
Robert has published more than 50 papers and is a regular speaker about technology in an increasingly connected world. He has a PhD from Stanford University.
Rob will be speaking as part of NLP Day Seattle.

Josh Perryman (Bryan / College Station) @joshperryman

Josh Perryman, Data Architect at Expero, likes to play with data. Oftentimes this is implementing proprietary algorithms closer to the data for performance or scale. Sometimes it is ad-hoc investigation and analysis, a sort of exploratory querying. A few times he’s been able to leverage his experience with data engines for dramatic performance improvements. But the real joy is designing a schema for both functionality and performance, one which increases the productivity of other developers and enables a technology to solve new problems or deliver new value to the business.
But technology isn't just data, and he does more than just play with data. He’s worked with high performance computing (HPC) environments, taking computations from hours to minutes or seconds. He has built visualizations which deliver new insights into complex data domains. He’s managed technology personnel, both directly and indirectly, to deliver technology solutions. He’s have put together more types of technology components, software and hardware, than can be counted, because one of his fortes is solving problems by building sustainable systems.
Josh will be giving the following Graph Day Seattle presentation: Securing Federated Data with TinkerPop and how to handle the “search engine” problem
Josh will also be offering a Thursday afternoon workshop: Hands-on Introduction to TinkerPop and the Gremlin Query Language.

Alan Pita (Austin)

Alan Pita, Graph Developer and Architect at Expero, has 20+ years of experience as a developer and architect for high-performance software-hardware systems scaling to multiple data centers with thousands of participating nodes each. He specializes in helping firms productize and monetize complex software technology from emerging research. He has three patents stemming from 10 years of work at IBM’s Server Division. He has led globally distributed technical teams, mined and managed agile product requirements, and built a proven track record of delivering boundary-defying technical innovations. Alan brings extensive technical experience in the areas of programming languages, complex SoC design, computer system architecture and functional design verification. Alan has a Bachelor’s Degree in Computer Science and Engineering from Texas A&M University and a Master's Degree in Computer Science from Stanford University. He is currently on hiatus from the Ph.D. program in Electrical and Computer Engineering at the University of Texas at Austin. He is also a graduate of the IBM Leadership Excellence course.
Alan will be giving the following Graph Day Seattle presentation: Interactive prototyping of Graph Applications with JanusGraph

Haikal Pribadi (London) @ haikalpribadi

Haikal Pribadi is the Founder and CEO of GRAKN.AI, the database for AI. His interest in the field began at the Monash Intelligent Systems Lab, where he built an open source driver for the Parallax Eddie Robot which was then adopted by NASA. After which, he completed a masters degree in AI from the University of Cambridge. Haikal was also the youngest Algorithm Expert behind Quintiq’s Optimisation Technology behind some of the world’s largest supply chain systems in transportation, retail and logistics. He now works on GRAKN.AI, a distributed knowledge base with that uses machine reasoning to handle and interpret complex data. GRAKN.AI was recently awarded Product of the Year 2017 by the University of Cambridge Computer Lab.
Haikal will be speaking as part of Graph Day Seattle.

Steve Purves

Steve Purves, Senior Software Developer at Expero, describes himself as an engineer first and foremost. He is comfortable working full-stack, cross-platform in a range of languages and is happiest when there is some mathematical or scientific analysis sprinkled in. He graduated in electrical engineering specializing in signal and image processing, which he took into the scientific computing field in the Oil and Gas industry.
During that time his work was largely split into three: development of low-level number-crunching libraries (C, C++, CUDA) and the cross-platform desktop application with 3D visualization to drive it; applied research in signal processing, numerical analysis algorithm development for 3D seismic analysis, during which he was an IEEE journal geek; and finally management of R&D and Product development teams as CTO, championing practices like TDD, BDD and Agile to get it done.
Around 5 years ago, the excitement of daily binary builds wore thin and Steve got hooked on building applications for the web, starting out with web-desktop integration work for seismic analysis on the iPad. Since then activities have included working on full-stack web applications, with and without desktop integration, for startups in sectors such as Dental, TV Production and Software Micro-Consulting.
Steve will be giving the following Graph Day Seattle presentation: Graph Representations in Machine Learning

Nelson Ray (SF Bay)

Nelson Ray manages the Risk Science group at Opendoor in San Francisco. His team is responsible for pricing the fee for Opendoor's home buying service and for optimizing resale strategy using a variety of machine learning models and experimental techniques. Prior to joining Opendoor, Nelson was a data scientist at Google and a software engineer at Metamarkets. He holds a BS in mathematics and an MS and PhD in statistics from Stanford University.

 

Julia Silge (Salt Lake City) @juliasilge

Julia Silge (LinkedIn / GitHub) is a data scientist at Stack Overflow. She enjoys making beautiful charts, the statistical programming language R, black coffee, red wine, and the mountains of her adopted home here in Utah. She has a PhD in astrophysics and an abiding love for Jane Austen. Her work involves analyzing and modeling complex data sets while communicating about technical topics with diverse audiences.
Julia will be speaking as part of NLP Day Seattle..

Trey Wilson

As a software consultant for Expero, Trey Wilson focuses on getting real code working quickly. Trey loves to work with the newest technologies, but only when they serve to achieve a pragmatic goal.
As soon as the “middle-of-nowhere” town that Trey grew up in had the Internet, he taught himself to program and hasn’t looked back since. He has programmed everything from displaying real-time geospatial communications data to image recognition using deep learning algorithms. The quickest way to get Trey interested in a problem is to say “This looks difficult.” Trey enjoys traveling to new cities to see if they have any good craft beers, reading up on esoteric subjects in case they ever become useful, and cooking like an amateur.
Trey will be giving the following Graph Day Seattle presentation: Graph Representations in Machine Learning

Dr. Yu Xu (Redwood City)

Dr. Yu Xu is the President and founder of GraphSQL. He received his Ph.D in Computer Science and Engineering from UC San Diego. He is an expert in Big Data and Parallel Database Systems. He has 26 patents in parallel data management and optimization. He worked on Twitter’s data infrastructure and analytics for massive data analytics in Twitter. Before Twitter, as Teradata’s Hadoop architect and team leader, he led Teradata’s initiatives in Big Data.
Dr. Xu will be speaking as part of Graph Day Seattle.