Media Summary: Performance optimization in distributed systems is both hard and valuable. It's hard because information like task runtimes, data ... Dask is a pure Python library for parallel and distributed computing. Last year Dask parallelized NumPy and Pandas computations ... In this talk I present a survey of forms and tools that are used by practicing data journalists. I walk through examples of different ...
Plotcon 2016 Matthew Rocklin Visualizing - Detailed Analysis & Overview
Performance optimization in distributed systems is both hard and valuable. It's hard because information like task runtimes, data ... Dask is a pure Python library for parallel and distributed computing. Last year Dask parallelized NumPy and Pandas computations ... In this talk I present a survey of forms and tools that are used by practicing data journalists. I walk through examples of different ... With the availability of powerful but relatively low-level plotting libraries like d3.js, plot.ly, and matplotlib, it is easier than it has ever ... PyData New York City 2017 Slides: This talk discusses ongoing work to ... Data science can create incredible value for companies. Those that do it well, use it as a tool for strategic differentiation in the ...
As part of the eScience Institute's Guest Seminar Series, guest speaker Dask developers joined with Python web developers to make Coiled, a global scale service providing managed Dask to everyone ... At Spotify, we face the challenge of not just big data, but deep data. There's no doubt that 100 million users listening to billions of ...