Who will be speaking at Data Day Seattle?

Keynote - Jonathon Morgan (Austin) @jonathonmorgan

The bio that Jonathon sent us was way too humble. Jonathon has his hands in many many things. Most recently, Jonathon created Data For Democracy. You can read about its origins here- Lynn

Jonathon Morgan (Linkedin) is Founder and CEO at New Knowledge. a company building technologies to understand and predict human behavior. As part of his ongoing work applying quantitative methods to combating violent extremism, he served as an advisor to the White House and State Department, co-authored the ISIS Twitter Census for the Brookings Institution, and develops new technology with DARPA. Jonathon is also the co-host of Partially Derivative, an unrealistically popular podcast about data science and drinking.
Jonathan will be giving the keynote presentation: This is Our Fight: Technology for Defending Public Discourse

Graph Keynote - Haikal Pribadi (London) @ haikalpribadi

Haikal Pribadi is the Founder and CEO of GRAKN.AI, the database for AI. His interest in the field began at the Monash Intelligent Systems Lab, where he built an open source driver for the Parallax Eddie Robot which was then adopted by NASA. After which, he completed a masters degree in AI from the University of Cambridge. Haikal was also the youngest Algorithm Expert behind Quintiq’s Optimisation Technology behind some of the world’s largest supply chain systems in transportation, retail and logistics. He now works on GRAKN.AI, a distributed knowledge base with that uses machine reasoning to handle and interpret complex data. GRAKN.AI was recently awarded Product of the Year 2017 by the University of Cambridge Computer Lab.
Haikal will be giving the keynote Graph Day Seattle presentation: Databases: the past, the present, and the future in cognitive computing
Haikal will be co-presenting Tech Battle: Machine Learning vs Graphs

Tyler Akidau (Seattle) @takidau

Tyler Akidau (Linkedin) is a staff software engineer at Google Seattle. He leads technical infrastructure’s internal data processing teams (MillWheel & Flume), is a founding member of the Apache Beam PMC, and has spent the last seven years working on massive-scale data processing systems. Though deeply passionate and vocal about the capabilities and importance of stream processing, he is also a firm believer in batch and streaming as two sides of the same coin, with the real endgame for data processing systems the seamless merging between the two. He is the author of the 2015 Dataflow Model paper and the Streaming 101 and Streaming 102 articles on the O’Reilly website. His preferred mode of transportation is by cargo bike, with his two young daughters in tow.
Tyler will be giving the following presentation: Foundations of Streaming SQL or: How I Learned to Love Stream & Table Theory

John Akred (SF Bay) @BigDataAnalysis

John Akred is the Founder and CTO of Silicon Valley Data Science. In the business world, John Akred likes to help organizations become more data driven. He has over 15 years of experience in machine learning, predictive modeling, and analytical system architecture. His focus is on the intersection of data science tools and techniques; data transport, processing and storage technologies; and the data management strategy and practices that can unlock data driven capabilities for an organization. A frequent speaker at the O'Reilly Strata Conferences, John is host of the perennially popular workshop: Building A Data Platform.
John will be giving the following presentation: Machine Learning: From The Lab To The Factory

Dave Bechberger (Houston)

Dave Bechberger is a Sr. Architect at Gene by Gene, a genetic genealogy and bioinformatics company, where he works extensively on developing their next-generation data architecture. Dave has spent his career engaging in full stack software development but specializes in building data architectures in complex data domains such as bioinformatics, oil and gas, supply chain management, etc. He uses his knowledge of graph and other big data technologies to build out highly performant and scalable systems. Dave has previously spoken at a variety of international technical conferences including NDC Oslo, NDC London, and Graph DayTexas.
Dave will be giving the following Graph Day Seattle presentation: Graph Analytics - For Fun and Profit.

Luis Bosquez (Seattle) @_lbosq

Luis Bosquez (Linkedin / GitHub) is a Program Manager at Microsoft working on Azure Cosmos DB, Microsoft’s new globally distributed, multi-model database service. Luis’ previous experience includes SQL Server, Docker and multi-language open-source connectors.
Luis will be co-presenting two Graph Day Seattle sessions:
Build horizontally scalable graphs and real-time data science solutions with Azure Cosmos DB, and
A lap around Azure Cosmos DB: Microsoft's globally distributed, multi-model database

Ryan Boyd (SF Bay)

Ryan Boyd (Linkedin) is a SF-based software engineer focused on helping developers understand the power of graph databases. Previously he was a product manager for architectural software, built applications and web hosting environments for higher education, and worked in developer relations for twenty products during his 8 years at Google. He enjoys cycling, sailing, skydiving, and many other adventures when not in front of his computer.
Ryan will be giving the following Graph Day Seattle presentation: Combining graph analytics with real-time graph query workloads for solving business problems.

Sanghamitra Deb (SF Bay) @sangha_deb

Sanghamitra Deb is a Senior Data Scientist at Chegg Inc. At Chegg she works on a wide range of projects related to developing a recommendation system for Chegg online tutoring, detecting student and tutor intents using natural language processing and is heavily involved with A/B testing machine learning models. In the past she has worked at Accenture Tech Labs developing algorithmic solutions to business problems. Prior to being a data scientist she did her phd in astrophysics and studied the formation and evolution of the universe by analyzing gravitational lensing by galaxy clusters.
Sanghamitra will be giving the following NLP Day presentation: Evolution of Natural Language Comprehension with Human Machine Collaboration.

Mike Downie (Bryan/College Station)

Mike Downie, Technical Lead at Expero, is driven to solve technical problems in the most practical way possible. To do this well he developed a diverse set of skills to truly understand the problem(s) at hand and a depth of technical knowledge to create great solutions.
Lifelong interests in technology, math, and science and that drive to solve problems led Mike to pursue a Computer Science degree at Texas A&M. He wasn’t content to learn about software in a classroom, he wanted to apply concepts in real-world settings as soon as possible. His final three years of college he worked professionally developing software. First as a co-op student for a (then) national telecommunications company and later part-time for a software consulting company. This ‘dual education’ allowed Mike to graduate with a strong academic foundation and the practical skills to apply his knowledge.
After completing his degree Mike continued to work for the consulting company, helping the company grow from 4 employees to more than 60 over the course of a decade. He gained expertise through success on a diverse set of projects. Project work included data acquisition and control, database applications and user interface design. During this period Mike also built software teams, studying software project management, estimation, requirements gathering, application lifecycle management and learning to effectively mentor junior developers.
Mike’s most recent work includes database design and front-end development for enterprise resource planning systems. He continues to look for difficult problems needing great solutions. What makes a great solution? He says first and foremost it has to work well, meeting both the functional and non-functional requirements. A solution can’t just work it has to work fast enough, be intuitive to the user and be maintainable by the developers to be great.
Mike will be giving the following Graph Day Seattle presentation: Graph Data Obfuscation

Garrett Eastham (Austin) @data_exhaust

Garrett Eastham is a practicing data scientist and serial entrepreneur working at the intersection of Artificial Intelligence and Digital Commerce. He has been working within enterprise ecommerce for the past 6 years - working exclusively with leading retail technology innovators such as Edgecase (which he founded) and RetailMeNot.
Garrett will be giving the following NLP Day presentation: Bootstrapping Knowledge-bases from Text.

David Gilardi (Orlando)

David Gilardi is a Technical Evangelist at DataStax, and is a total nerd for distributed databases - with a particular interest in distributed graph. He has over 20 years of relevant experience in programming, database administration, cloud, server/network monitoring, and analytics. Before his time at DataStax he was Senior Development Manager at Hobsons, an education services company, responsible for a flagship SaaS CRM product deployed on hybrid cloud using a combination of relational and NoSQL database technologies.
David will be presenting the following Graph Day session: 3 ways to build a near real-time recommendation engine

Dr. Denise Koessler Gosnell (Charleston) @DeniseKGosnell

In August 2017, Dr. Denise Gosnell, transitioned into a Solutions Architect position with DataStax where she aspires to build upon her experiences as a data scientist and graph architect to further their established line of graph solutions. Prior to her role with DataStax, Dr. Gosnell was a Data Scientist and Technology Evangelist at PokitDok. During her three years with PokitDok, she built software solutions for and spoke at over a dozen conferences on permissioned blockchains, machine learning applications of graph analytics, and data science within the healthcare industry.
Dr. Gosnell earned her Ph.D. in Computer Science from the University of Tennessee. Her research on how our online interactions leave behind unique identifiers that form a “social fingerprint” led to presentations at major conferences from San Diego to London and drew the interest of such tech industry giants as Microsoft Research and Apple. Additionally, she was a leader in addressing the underrepresentation of women in her field and founded a branch of Sheryl Sandberg’s Lean In Circles.
Dr. Gosnell will be presenting the following Graph Day session: Everything is not a graph problem (but there are plenty)

Holden Karau (San Francisco) @holdenkarau

Holden Karau is a software development engineer and is active in open source. She a co-author of Learning Spark & Fast Data Processing with Spark and has taught intro Spark workshops. Prior to IBM she worked on a variety of big data, search, and classification problems at Alpine, DataBricks, Google, Foursquare, and Amazon. She graduated from the University of Waterloo with a Bachelors of Mathematics in Computer Science. Outside of computers she enjoys dancing & playing with fire.
Holden will be holding the following session: Testing in Apache Spark 2+: avoiding the fail boat beyond RDDs.

Jason Kessler (Seattle) @jasonkessler

Jason Kessler (LinkedIn) is a lead data scientist at CDK Global, where he analyzes language use and consumer behavior in the online auto-shopping ecosystem. Prior to joining CDK, Jason was the founding data scientist at PlaceIQ and worked as a research scientist for JD Power and Associates. He has published peer-reviewed papers on algorithms and corpora for sentiment and belief analysis and has sat on program committees and reviewed for several AI and NLP conferences. Most recently, he has conducted research on identifying persuasive and influential language and the visualization of differing corpora.
Jason will be giving the following NLP Day presentation: Understanding Cultures and Perspectives through Text and Emjoi Visualization.

Gunnar Kleemann (Berkeley / Austin)

Gunnar Kleemann is a Data Scientist with the Berkeley Data Science Group (BDSG). He is interested in how data science facilitates biological discovery and lowers the barrier to high-throughput research, particularly in small, independent labs. In addition to his work with BDSG, he is also involved in the development and implementation of technologies like the ATX Hackerspace Biology Laboratory.
Gunnar holds a PhD in Molecular Genetics from Albert Einstein College of Medicine and a Master’s in Data Science from UC Berkeley. He did post-doctoral research on the genomics of aging at Princeton University, where his research focused developing high throughput robotic assays to understand how genetic changes alter lifespan and reproductive biology.
Gunnar will be speaking as part of Graph Day Seattle.

Zornitsa Kozareva (SF Bay) @zkozareva

Zornitsa Kozareva (Linkedin) is a Manager of the AWS Deep Learning group at Amazon that builds and solves natural language processing and dialog applications. Prior to joining Amazon, Dr. Kozareva was a Senior Manager at Yahoo! leading the Query Processing group that powered Mobile Search and Advertisement. From 2009 to 2014, Dr. Kozareva wore an academic hat as Research Professor at the University of Southern California CS Department with affiliation to Information Sciences Institute, where she spearheaded research funded by DARPA and IARPA on topics such as learning to read, interpreting metaphors and building knowledge bases from the Web. Dr. Kozareva regularly serves as Area Chair and PC of top-tier Natural Language Processing conferences. She has organized four SemEval scientific challenges and has published over 80 research papers. Dr. Kozareva is a recipient of the John Atanasoff Award given by the President of Republic of Bulgaria in 2016 for her contributions and impact in science, education, and industry; the Yahoo! Labs Excellence Award in 2014 and the RANLP Young Scientist Award in 2011.
Zornitsa will be giving the following NLP Day presentation: Building Conversational Assistants with Deep Learning.

Stefan Krawczyk (San Francisco) @stefkrawczyk

Stefan Krawczyk loves the stimulus of working at the intersection of design, engineering, and data. He spent formative years at Stanford, LinkedIn, Nextdoor & Idibon, working on everything from growth engineering, product engineering, data engineering, to recommendation systems, NLP, data science and business intelligence. At Stitch Fix he’s leading development of the algorithm development platform.
Stefan will be speaking as part of NLP Day Seattle.

Denny Lee (Seattle) @ dennylee

Denny Lee is a Principal Program Manager at Microsoft for the Azure CosmosDB team - Microsoft's globally distributed, multi-model database. He is a hands-on distributed systems and data sciences engineer with extensive experience developing internet-scale infrastructure, data platforms, and predictive analytics systems for both on-premise and cloud environments. Prior to joining the Azure CosmosDB team, Denny worked as a Technology Evangelist at Databricks; he has been working with Apache Spark since 0.5. He was also the Senior Director of Data Sciences Engineering at Concur, and was on the incubation team that built Microsoft's Hadoop on Windows and Azure service (currently known as HDInsight). Denny also has a Masters of Biomedical Informatics from Oregon Health and Sciences University.
Denny will be co-presenting the following Graph Day Seattle presentation: Applying an active learning algorithm for entity de-duplication in graph data

William Lyon (SFBay) @lyonwj

William Lyon is a software developer at Neo4j, the open source graph database. As an engineer on the Developer Relations team, he works primarily on integrating Neo4j with other technologies, building demo apps, helping other developers build applications with Neo4j, and writing documentation. Prior to joining Neo, William worked as a software developer for several startups in the real estate software, quantitative finance, and predictive API fields. William holds a Masters degree in Computer Science from the University of Montana. You can find him online at lyonwj.com.
William will be giving the following Graph Day Seattle presentation: Applying an active learning algorithm for entity de-duplication in graph data

Kevin Madden (SFBay)

With more than twenty years of experience in the software industry, Kevin Madden brings an innovative approach to Tom Sawyer Software. He plays a strategic role in the direction of the company's server-based products to support the latest and emerging technologies. As Chief Software Engineer, Kevin oversees the design, creation, and execution of these products and ensures that his teams deliver robust, flexible, and easy-to-use products.
Kevin's leadership experience includes Chief Executive Officer and cofounder of Clooster, an enterprise and web search company that was acquired by Tom Sawyer Software. At Clooster, he was responsible for the overall vision, corporate structure, strategy, partnerships, and financing of the company.
Prior to Clooster, Kevin was the lead architect for several of Embarcadero Technologies' database products, including DBArtisan and RapidSQL. Kevin also led the design and development of Sun Microsystems' NetBeans UML modeling framework. He has also held senior software engineering roles at Symantec, Informix Software, and the IBM Thomas J. Watson Research Center. Kevin holds a Bachelor of Science degree in Computer Science from the State University of New York.
Kevin will be giving the following Graph Day Seattle presentation: Case Study: Visualize and Analyze the GDELT Global Knowledge Graph.

Charity Majors (San Francisco) @mipsytipsy

Charity Majors is an engineer and cofounder at Honeycomb - a new startup focused on making impossible data observability problems not only possible, but exploratory and inviting for teams of all types. Prior to Honeycomb, Charity led the Parse infrastructure team as they grew Parse from a handful of mobile apps to over a million, then worked as an engineering manager at Facebook, while pairing closely with the RocksDB team to develop and roll out the world's first Mongo+Rocks in production. She is a reluctant DBA who has spent far too much time running Mongo, Cassandra, Mysql, Redis, and probably more but those brain cells are gone. Charity is co-author ( with Laine Campbell) of the upcoming O'Reilly book Database Reliability Engineering and loves single malt scotch.
Charity's GitHub

Rob McDaniel (Seattle)

Rob McDaniel is Rob is the founder of Lingistic, the machine learning team behind howbiased.com. We have focused on NLP problems related to politics, debate analysis and the detection of bias in editorials. Our goal is to help humans learn to be more critical of the material they ingest, by identifying traits and cues in the language which may be hidden or non-obvious.
Rob has a diverse background in engineering and machine learning, both with major corporations and startups. He has worked on problems related to machine translation, taxonomy classification and information extraction, and has a passion for unsupervised methods and graph theory. When not working on his startup, Rob is also the Manager of the Applied Science department at Rakuten, where he manages AI that expands the depth and quality of Rakuten's global product catalog.

Rob will be giving the following NLP Day presentation: Detecting Bias in News Articles

Yashar Mehdad (San Francisco)

Yashar Mehdad is a senior ML/NLP Scientist at Airbnb, and a frequent speaker at NLP conferences -- including EMNLP and ACL.
Yashar will be speaking as part of NLP Day Seattle.

Christian Miles (Ontario, Canada)

Christian Miles is the Technical Sales Manager at Cambridge Intelligence. Since completing his Masters in Maths & Computer Science at Bristol University in the UK, Christian has specialized in graph visualization software for global enterprise deployments. In his roles at BAE Systems and the Wynyard Group, Christian's focus has been applying graph network analysis in Financial Crime, Cyber and Law Enforcement domains. Christian is a North American Sales Engineer for Cambridge Intelligence, makers of KeyLines, and is based out of Canada.
Christian will be giving the following Graph Day Seattle presentation: Exploring the graph database landscape through graph visualization.

Jonathan Mugan (Austin) @jmugan

Jonathan Mugan (Linkedin) is a researcher specializing in artificial intelligence, machine learning, and natural language processing. His current research focuses in the area of deep learning for natural language generation and understanding. Dr. Mugan received his Ph.D. in Computer Science from the University of Texas at Austin. His thesis was centered in developmental robotics, which is an area of research that seeks to understand how robots can learn about the world in the same way that human children do. Dr. Mugan also held a post-doctoral position at Carnegie Mellon University, where he worked at the intersection of machine learning and human-computer interaction. One of the most requested speakers at the Data Day Texas conferences, he recently also spoke on the topic of NLP at the O’Reilly AI conference, and is the creator of the O’Reilly video course Natural Language Text Processing with Python. Dr. Mugan is also the author of The Curiosity Cycle: Preparing Your Child for the Ongoing Technological Explosion.
Dr. Mugan will be giving two NLP Day presentations:
From Natural Language Processing to Artificial Intelligence
Chatbots from First Principles

Josh Perryman (Bryan / College Station) @joshperryman

Josh Perryman, Data Architect at Expero, likes to play with data. Oftentimes this is implementing proprietary algorithms closer to the data for performance or scale. Sometimes it is ad-hoc investigation and analysis, a sort of exploratory querying. A few times he’s been able to leverage his experience with data engines for dramatic performance improvements. But the real joy is designing a schema for both functionality and performance, one which increases the productivity of other developers and enables a technology to solve new problems or deliver new value to the business.
But technology isn't just data, and he does more than just play with data. He’s worked with high performance computing (HPC) environments, taking computations from hours to minutes or seconds. He has built visualizations which deliver new insights into complex data domains. He’s managed technology personnel, both directly and indirectly, to deliver technology solutions. He’s have put together more types of technology components, software and hardware, than can be counted, because one of his fortes is solving problems by building sustainable systems.
Josh will be giving the following Graph Day Seattle presentation: Securing Federated Data with TinkerPop and how to handle the “search engine” problem
Josh will also be offering a Thursday afternoon workshop: Hands-on Introduction to TinkerPop and the Gremlin Query Language.

Alan Pita (Austin)

Alan Pita, Graph Developer and Architect at Expero, has 20+ years of experience as a developer and architect for high-performance software-hardware systems scaling to multiple data centers with thousands of participating nodes each. He specializes in helping firms productize and monetize complex software technology from emerging research. He has three patents stemming from 10 years of work at IBM’s Server Division. He has led globally distributed technical teams, mined and managed agile product requirements, and built a proven track record of delivering boundary-defying technical innovations. Alan brings extensive technical experience in the areas of programming languages, complex SoC design, computer system architecture and functional design verification. Alan has a Bachelor’s Degree in Computer Science and Engineering from Texas A&M University and a Master's Degree in Computer Science from Stanford University. He is currently on hiatus from the Ph.D. program in Electrical and Computer Engineering at the University of Texas at Austin. He is also a graduate of the IBM Leadership Excellence course.
Alan will be giving the following Graph Day Seattle presentation: Interactive prototyping of Graph Applications with JanusGraph

Aravind Krishna R (Seattle) @ arkramac

Aravind Krishna R is a Principal Program Manager with the Azure Cosmos DB team, Microsoft’s new globally distributed, multi-model database service. He is one of the team members responsible for building the new Gremlin graph APIs for Azure Cosmos DB. He has worked in various roles building databases and distributed systems, and has been with the Azure Cosmos DB team for the last 4 years.

 

Julia Silge (Salt Lake City) @juliasilge

Julia Silge (LinkedIn / GitHub) is a data scientist at Stack Overflow. She enjoys making beautiful charts, the statistical programming language R, black coffee, red wine, and the mountains of her adopted home here in Utah. She has a PhD in astrophysics and an abiding love for Jane Austen. Her work involves analyzing and modeling complex data sets while communicating about technical topics with diverse audiences.
Julia will be giving the following NLP Day presentation: Text Mining Using Tidy Data Principles.

Joshua Shinavier (SF Bay)

Joshua Shinavier is a member of Uber’s knowledge graph team. He is a founding contributor of what is now Apache TinkerPop, and holds a PhD in Web science from Rensselaer Polytechnic Institute. A number of his open-source tools serve to bridge the gap between graph databases and the Semantic Web.
Joshua will be giving the following Graph Day Seattle presentation: Evolution of the Graph Schema

Denis Vrdoljak (SF Bay)

Denis Vrdoljak (Co-Founder and Managing Director at the Berkeley Data Science Group (BDSG)): Denis is a Berkeley trained Data Scientist and a Certified ScrumMaster (CSM), with a background in Project Management. He has experience working with a variety of data types-- from intelligence analysis to electronics QA to business analytics. In Data Science, his passion and current focus is in Machine Learning based Predictive Analytics and Network Graph Analysis. He holds a Master's in Data Science from the UC Berkeley and a Master's in International Affairs from Texas A&M.
Denis will be co-presenting two graph day Graph Day Seattle presentations:
Tech Battle: Machine Learning vs Graphs
, and
Improving the Hiring Pipeline with GraphAI

Rachel Warren (San Francisco) @warre_n_peace

Rachel Warren is a Data Scientist and Software Engineer at Salesforce Einstein where she works to build a platform to automate predictive modeling for Salesforce. She was previously at Alpine Data Labs in San Francisco. Rachel is a Spark engineer, functional programmer, and data scientist. She has worked on financial, political, and natural language problems. In addition to coding she is passionate about teaching and working with people. She has taught computer science and math in Ghana, and is now helping educate her peers on Spark in San Fransisco. Rachel, along with Holden Karau, is co-author of the best-selling O’Reilly title High Performance Spark.

Trey Wilson

As a software consultant for Expero, Trey Wilson focuses on getting real code working quickly. Trey loves to work with the newest technologies, but only when they serve to achieve a pragmatic goal.
As soon as the “middle-of-nowhere” town that Trey grew up in had the Internet, he taught himself to program and hasn’t looked back since. He has programmed everything from displaying real-time geospatial communications data to image recognition using deep learning algorithms. The quickest way to get Trey interested in a problem is to say “This looks difficult.” Trey enjoys traveling to new cities to see if they have any good craft beers, reading up on esoteric subjects in case they ever become useful, and cooking like an amateur.
Trey will be giving the following Graph Day Seattle presentation: Graph Representations in Machine Learning

Dr. Yu Xu (Redwood City)

Dr. Yu Xu is the founder and CEO of TigerGraph, the world’s first native parallel graph database. Dr. Xu received his Ph.D in Computer Science and Engineering from the University of California San Diego. He is an expert in big data and parallel database systems and has over 26 patents in parallel data management and optimization. Prior to founding TigerGraph, Dr. Xu worked on Twitter’s data infrastructure for massive data analytics. Before that, he worked as Teradata’s Hadoop architect where he led the company’s big data initiatives.
Dr. Xu will be giving the following Graph Day Seattle presentation: Scaling Deep Link Graph Analytics using Native Parallel Graph by TigerGraph