fbpx
How to Find Duplicates (and Near-Duplicates) in a Corpus with NLP
Building a large high-quality corpus for Natural Language Processing (NLP) is not for the faint of heart. Text data can be large, cumbersome, and unwieldy and unlike clean numbers or categorical data in rows and columns, discerning differences between documents can be challenging. In organizations where documents are... Read more
The Three Trending Data Science Jobs and How to Land Them
There has been a lot of buzz about data scientist jobs recently. And for good reason! Since 2016, data scientist has been at or near the top of Glassdoor’s Best Jobs in America list. But since the job hit the top spot in the list, the field has... Read more
Three Advanced Metrics to Optimize Your Fantasy Football Lineup
You’re down by 10 points in your NFL fantasy football league, and you need to choose a wide receiver from the free agency pool because your starter was injured. How do you decide to get the 11 points required for a win? What methods will you... Read more
3 Ways to Enhance Productivity with AI
Pre-processing and exploring data, building and deploying models and turning those scoring values into an actionable insight can be overwhelming. A recent survey shows that for data scientists, the many tasks they spend their time working on are very different from the tasks they actually want to prioritize. This... Read more
How Data Scientists Used NLP to Save Indigenous African Languages
Data scientists in Cameroon joined forces to form team LangTech as a part of the SAS Global Hackathon. In the face of rapid digitalization and modernization, they sought a way to preserve indigenous African languages. There are over 1,000 African languages, but those with fewer than 100,000 speakers... Read more
ModelOps, MLOps, and Finding Value in Analytics
In a recent survey, 42% of data scientists reported that their results were not used by decision-makers. This is a twofold issue. First, there are the organizations that have invested hundreds of billions of dollars into analytics worldwide annually. Organizations have invested in data collection, cleaning,... Read more
Data Science Talent Strategies for the Great Resignation
The COVID-19 pandemic brought huge changes in the workplace, such as a massive increase in remote working. These may or may not last. However, one pandemic-driven change shows no signs of stopping: the Great Resignation. When the pandemic first hit, resignations as a proportion of the... Read more
How to Become a Data Scientist Who Lives at the Beach
Everyone has a unique career path. In data science, often starts with curiosity about a subject that can only be untangled by applying analytics. In Robert Blanchard’s case, he was studying economics and wanted to learn more about consumer behaviors as they relate to buying patterns.... Read more
Exploring Natural Language Processing: Two Ways You Can Leverage Corpus Analysis
Corpus analysis is a technique widely used by data scientists because it provides understanding of a document collection and provides insights about the text.  It’s an apt methodology to consider as we came upon Charles Dickens’ 210th birthday earlier this year because of how frequently passages... Read more
Your Guide For Analyzing Real Time Data with Streaming Analytics from SAS® Viya® on Azure
As artificial intelligence comes of age and data continues to disrupt traditional industry boundaries, the need for real-time analytics is escalating as organizations fight to keep their competitive edge. The benefits of real-time analytics are significant. Manufacturers must inspect thousands of products per minute for defects.... Read more