fbpx
GraphReduce: Using Graphs for Feature Engineering Abstractions
Editor’s note: Wes Madrigal is a speaker for ODSC West this October 30th to November 2nd. Be sure to check out his talk, “Using Graphs for Large Feature Engineering Pipelines,” there to learn more about GraphReduce and more! For readers who work in ML/AI, it’s well... Read more
Data Morph: Moving Beyond the Datasaurus Dozen
Editor’s note: Stefanie Molin is a speaker for ODSC West 2023 this Fall. Be sure to check out her talk, “Data Morph: A Cautionary Tale of Summary Statistics,” there! This article introduces Data Morph, a new open-source Python package that can be used to morph an input... Read more
Visualization for Clustering Methods
Editor’s note: Evie Fowler is a speaker for ODSC West. Be sure to check out her talk, “Bridging the Interpretability Gap in Customer Segmentation,” there! At this Fall’s Open Data Science Conference, I will talk about how to bring a systematic approach to the interpretation of... Read more
Graph Viz: Exploring, Analyzing, and Visualizing Graphs and Networks with Gephi and ChatGPT
Editor’s note: Tamilla is a speaker for ODSC East this May. Be sure to check out her talk, “Graph Viz: Exploring, Analyzing and Visualizing Graphs and Networks with Gephi and ChatGPT,” there! Analyzing the collaboration between authors of scientific papers can provide valuable insights into research... Read more
How to Deploy a Deep Learning Model with Jina (and Design a Kitten Along the Way)
The following is an article by Alaeddine Abdessalem and Alex C-G of Jina AI. Since the start of Jina, we’ve supported building Flows – namely pipelines of microservices. Now we’re breaking those microservices out of the Flow and letting you deploy them stand-alone in the form of an Executor and Deployment.... Read more
Multimodal Visual Question Answering with BLIP-2 and Jina
The following is an article by Alaeddine Abdessalem and Alex C-G of Jina AI. Large Language Models (LLMs) have recently gained a lot of traction, with many popular models showing up like GPT, OPT, BLOOM and more. These models excel at learning natural language, making them... Read more
11 Open Source Data Exploration Tools You Need to Know in 2023
There are many well-known libraries and platforms for data analysis such as Pandas and Tableau, in addition to analytical databases like ClickHouse, MariaDB, Apache Druid, Apache Pinot, Google BigQuery, Amazon RedShift, etc. While machine learning frameworks and platforms like PyTorch, TensorFlow, and scikit-learn can perform data... Read more
3 Tips for Using Python Libraries to Create 3D Animation
Data visualization is becoming increasingly popular, meaning open-source tools like Python are more widely used by data scientists and other computer science professionals. A specific type called 3D animation makes information accessible, visually attractive, and appealing to users. Powerful Python libraries enable you to create unique... Read more
Is Data Processing Capacity Important When Choosing Data Visualization Tools for Your Business?
Data processing capacity can have a major impact on the performance of any data visualization tool. Finding the right data visualization tool is about balancing your business’s on-premises computing resources and your data processing needs. What determines data processing capacity and how can businesses balance computing... Read more
How to Choose the Right Data Visualization Tools for Your Business
Depending on your business needs, you’ll prioritize specific data visualization tools over others. Most businesses favor standard pie charts and bar graphs, but others may select waterfall charts and heat maps.  Depending on your software, you’ll find that some data visualization tools create more stunning visuals... Read more