An Introduction to Object Oriented Data Science in Python
PythonTools & Languagesposted by megan@odsc.com August 31, 2016
A lot of focus in the data science community is on reducing the complexity and time involved in data gathering, cleaning, and organization. This article discusses how object oriented design techniques from software engineering can be used to reduce coding overhead and create robust, reusable data... Read more
Integrating Python and R
BlogPythonRTools & LanguagesJupyter|Python|R|rstudioposted by Gordon Fleetwood April 19, 2016
Over time, Python and R have established themselves as the leading languages for Data Science. The rise of both has not been frictionless, though, as the two communities have ‘clashed’ over philosophical differences as each side recruits Data Science newcomers. R users will recommend that R... Read more
Top 5 Tools and Languages for Data Science
BlogData VisualizationData WranglingModelingPythonRStatisticsToolsTools & LanguagesData Science|Python|R|SQLposted by Jason O'Rawe, ODSC Data Science Team April 18, 2016
Data science is an interdisciplinary endeavor, and it serves the purpose of extracting insight from varying sources of information. Various communities come together at Data Science Conferences to share their knowledge and promote innovation. It is not surprising, then, that the tools showcased by data scientists... Read more
Jupyter, Zeppelin, Beaker: The Rise of the Notebooks
PlatformsPythonToolsTools & LanguagesWorkflowData Science|Jupyter|Notebooksposted by Alex Perrier December 3, 2015
Standard software development practices for web, Saas, and industrial environments tend to focus on maintainability, code quality, robustness, and performance. Scientific programing in data science is more concerned with exploration, experimentation, making demos, collaborating, and sharing results. It is this very need for experiments, explorations, and... Read more
Riding on Large Data with Scikit-learn
Data WranglingModelingPlatformsPythonStatisticsToolsTools & LanguagesData Mining|Machine Learning|Python|Scikit-learnposted by Alex Perrier October 28, 2015
What’s a Large Data Set? A data set is said to be large when it exceeds 20% of the available RAM for a single machine. Which for your standard MacBook Pro with 8Gb of RAM, corresponds to a meager 2Gb dataset — size that is becoming... Read more
Saul Diez-Guerra at ODSC Boston 2015
ConferencesPythonTools & LanguagesODSC East 2015|Python|Speaker Talkposted by Open Data Science August 18, 2015
What We Learned While Teaching Python and Data Science Pedagogy and lessons learned from teaching an online introductory Python and Data Science courses. This is how we approached the matter, what we learned and where we want to go next. Presenter Bio: Saul Diez-Guerra works as... Read more