Streamlining Government Regulatory Responses with Natural Language Processing, GenAI, and Text Analytics
Imagine if your job was to sort a massive pile of 40,000 stones into about 200 buckets based on their unique properties. Each stone needs to be carefully examined, categorized, and placed in the correct bucket, which takes about five minutes per stone. Fortunately, you’re not... Read more
Demystifying Machine Learning: Popular ML Libraries and Tools
As a senior data scientist, I often encounter aspiring data scientists eager to learn about machine learning (ML). It’s a fascinating field that can seem daunting at first, but I assure you, with the right mindset and resources, anyone can master it. In this comprehensive guide,... Read more
Origins of Generative AI and Natural Language Processing with ChatGPT
By now you’ve likely heard of ChatGPT, and the varying opinions surrounding it—people love it, people hate it, and people are afraid of it. It can generate a recipe for chocolate chip cookies, write a Broadway-style song about your kids, and create usable code. Joining in... Read more
Augmented Analytics – Where Do You Fit in at the Intersection of Analytics and Business Intelligence?
Data visualization is a critical way for anyone to turn endless rows of data into easy-to-understand results through dynamic and understandable visuals.  And with augmented analytics (and embedded insights), anyone can become a citizen data scientist, regardless of their advanced analytics expertise. A shift has been... Read more
Leveraging Time-Series Segmentation and Machine Learning for Better Forecasting Accuracy
Several papers discussed the importance of segmenting time series into groups and modeling each group separately to enhance forecasting accuracy overall. But what does this look like in practice? At the end of the day, why not use an AutoML package (Automated Machine Learning) or an... Read more
How Text Analytics and AI Can Help Investigators Combat Human Trafficking
Narrative data from police agencies on arrest or offense incidents, as well as tips to police departments, is both rich in information and also largely unavailable to the public for analysis. That said, recently came across ~45,000 unique narratives describing police incidents occurring in the city... Read more
What to Expect in 2023: A Data Scientist’s Top 5 AI Predictions
AI has come a long way in recent years, and it shows no signs of slowing down. In fact, many experts believe that we are on the cusp of some major breakthroughs in the field of artificial intelligence. With that in mind, here are my top... Read more
How to Find Duplicates (and Near-Duplicates) in a Corpus with NLP
Building a large high-quality corpus for Natural Language Processing (NLP) is not for the faint of heart. Text data can be large, cumbersome, and unwieldy and unlike clean numbers or categorical data in rows and columns, discerning differences between documents can be challenging. In organizations where documents are... Read more
The Three Trending Data Science Jobs and How to Land Them
There has been a lot of buzz about data scientist jobs recently. And for good reason! Since 2016, data scientist has been at or near the top of Glassdoor’s Best Jobs in America list. But since the job hit the top spot in the list, the field has... Read more
Three Advanced Metrics to Optimize Your Fantasy Football Lineup
You’re down by 10 points in your NFL fantasy football league, and you need to choose a wide receiver from the free agency pool because your starter was injured. How do you decide to get the 11 points required for a win? What methods will you... Read more