Building Time On Site at Reddit
with Katie Bauer, Data Science Manager at Reddit:
Time on site is a foundational metric in web analytics & building it seems straightforward enough. But modern websites are built on the backs of distributed systems, & distributed systems make it particularly difficult to figure out when something actually happened. In this talk, we'll discuss how we implemented our own time on site metric, building ETLs with Google BigQuery & Apache Airflow, as well as the choices we made to do it, the problems we caused with those choices, & how we fixed them.
Bad Boys, Whatcha Gonna Do: Predicting Crime on the Streets of SF with Ruqaiya Shipchandler, Solutions Engineer at Dataiku:
While San Francisco is most famous for being the technological epicenter of the world, the city's infamous past as the home of notorious criminals at Alcatraz makes us wonder: what is SF's current criminal landscape? And can we use data science to proactively fight crime in the city?
We'll share how we used Dataiku DSS to explore over 12 years of SF crime data to understand key trends, & build a predictive model to pinpoint the category of crime that would occur given time & location.
About the Presenters
Katie Bauer Photo
Data Science Manager,
Katie Bauer was a founding member of Reddit's data science team & currently manages its Consumer Data Science & Analytics team. She has previously worked in search, digital advertising, & online retail, & is known for using company hackathons as an excuse to bake cakes at work.
Ruqaiya Shipchandler Photo
Ruqaiya is a Solutions Engineer at Dataiku based in the East Bay. She works with companies to address their data science challenges & implement efficient, sustainable solutions. Ruqaiya's initial exposure to data science was through her work in the Energy industry, where she had the opportunity to develop predictive asset maintenance models, models to minimize environmental impact, & improve employee safety. Ruqaiya has a degree in Chemical Engineering from the University of Houston.
About Our Partners
Dataiku is the centralized data platform that democratizes the use of data science, machine learning, & AI in the enterprise. With Dataiku, businesses are uniquely empowered to move along their data journey from data preparation to analytics at scale to Enterprise AI. By providing a common ground for data experts & explorers, a repository of best practices, shortcuts to machine learning & AI deployment/management, & a centralized, controlled environment, Dataiku is the catalyst for data-powered companies. More info on: www.dataiku.com