fbpx
ODSC East Warmup Guide to spaCy
spaCy is an open-source library used for advanced Natural Language Processing (NLP) in Python. spaCy is primarily used in the production environment for building applications that can process and understand large volumes of text. It is a fast library with an API that accesses the methods... Read more
ODSC Europe 2021 Top Picks: 11 of Our Favorite Sessions to Watch for Free
As we ramp up preparations for ODSC Europe 2022 coming to London and Virtually this June 15th-16th, we want to look back on ODSC Europe 2021. These sessions exemplified important topics in data science, such as responsible AI, machine learning safety, and trending tools for modeling.... Read more
The ODSC Warmup Guide to Keras
Keras is a Python library for deep learning. Deep learning is a sub-branch of artificial intelligence that focuses on solving complex computations by emulating the working process of a human brain. Neural networks, computational graphs composed of nodes representing multiple operators for breaking down the tasks... Read more
Warmup Guide to Apache Kafka
Apache Kafka is an open-source distributed event streaming platform used to collect, store, and integrate data at scale. Several companies use it for high-performance data pipelines, streaming analytics, data integration, and mission-critical applications. Apache Kafka has been instrumental across many industries for event streaming purposes such... Read more
The ODSC Warmup Guide to fastText
fastText is a lightweight Python library that was created to build scalable solutions for text representation and classification. It works on standard and generic hardware, smartphones, and small computers by utilizing functionalities that can reduce the memory consumed for fastText models. In recent years, fastText has... Read more
The ODSC Warmup Guide to PyTorch
PyTorch is an open-source framework built for developing machine learning and deep learning models. In particular, this framework provides the stability and support required for building computational models in the development phase and deploying them in the production phase.  PyTorch functionalities are extensible with other Python... Read more
The ODSC Warmup Guide to Apache Airflow
Apache Airflow is a workflow automation platform that schedules and monitors workflows in the data pipelines programmatically. Airflow makes it simpler to set up and operate an end-to-end data pipeline in the cloud. You can use Airflow to manage and create workflows without worrying about the... Read more
The ODSC Guide to Data Science Careers
In this free-to-download guide, you’ll learn everything you need to know to start your data science career in 2021, including how to find a job in data science, core skills to stand out, resume tips, and so on. Here’s a preview of what’s inside: How to Prepare... Read more
The Warmup Guide to Hugging Face
Since it was founded, the startup, Hugging Face, has created several open-source libraries for NLP-based tokenizers and transformers. One of their libraries, the Hugging Face transformers package, is an immensely popular Python library providing over 32 pre-trained models that are extraordinarily useful for a variety of... Read more
The Warmup Guide to OpenAI Gym
Founded in 2015 by Elon Musk, Sam Altman, and several others, OpenAI is a non-profit company dedicated to building friendly AI that is beneficial for everyone. One of its most well-known products is the OpenAI Gym. Gym is a toolkit for developing and comparing reinforcement learning... Read more