How Open Source Developers Can Push the Universe’s Frontier
Editor’s note: Dr.-Ing. Thomas Albin is a speaker for ODSC Europe this June 14th-15th. Be sure to check out his talk, “Space Science with Python – Enabling Citizen Scientists,” there! 2009, a paper by Postberg et al. was published in Nature. The title: Sodium Salts in... Read more
Top Low-Code and No-Code Platforms for Data Science in 2023
With all the talk about new AI-powered tools and programs feeding the imagination of the internet, we often forget that data scientists don’t always have to do everything 100% themselves. Often, some other tools and platforms cut out the middleman and allow you to create some... Read more
AgentChain: Chain Together Models to Perform Complex Tasks
AgentChain uses Large Language Models (LLMs) for planning and orchestrating multiple Agents or Large Models (LMs) for accomplishing sophisticated tasks. AgentChain is fully multimodal: it accepts text, image, audio, and tabular data as input and output. 🧠 LLMs as the brain: AgentChain leverages state-of-the-art Large Language Models to... Read more
Why the Best Time to Learn Python is Right Now
Python has been gaining popularity in the programming world for years thanks to its easy-to-learn syntax, and now it stands as one of the most widely used programming languages. Because of this, it has gained a reputation as the language of choice for data science and... Read more
Python Constants – Everything You Need to Know
Python Constants contribute to the dynamic and updatable characteristic of the design architecture, which is essential for any coding architecture. Providing a construction with these features is related to the fact that the code blocks can be understood and developed by someone else. To meet those... Read more
5 Easy SQL Tricks to Clean Dirty Data
Real-world data is almost always messy. And as a data scientist or analyst, you need to discover the facts about the data. To do so, the data must be tidy and free from errors. Hence, the very first step is to clean the data. Therefore, I... Read more
An Introduction to Orchestrating Data Assets with Dagster
Editor’s note: Sandy Ryza is a speaker for ODSC West this November 1st-3rd. Be sure to check out his talk, “Orchestrating Data Assets instead of Tasks, with Dagster,” there! Dagster is an open-source data orchestrator: a framework for building and running data pipelines, similar to how... Read more
From Pandas to Features to Models to Predictions – A Deep Dive Into the Hopsworks APIs
When it comes to feature stores, there are two main approaches to feature engineering. One approach is to build a domain-specific language (DSL) that covers all the possible feature engineering steps (e.g., aggregations, dimensionality reduction, and transformations) that a data scientist might need. The second approach... Read more
5 Preferred Programming Languages for Web Scraping
Web scraping or web harvesting requires a good tool to be undertaken efficiently. It involves data crawling, content fetching, searching, parsing, as well as data reformatting to make the collected data ready for analysis and presentation. It is important to use the right software and languages... Read more
3 Ways to Protect Your Code from Software Supply Chain Attacks
Supply chain attacks are intended to benefit from the trust that has grown between a business and a select number of outside partners. Considering that businesses use a wide variety of third-party software for communication, meetings, and the deployment of websites, among other things, it is... Read more