Author: Darrin Bishop
Term frequencies are a way “count” or represent a term in a document. Term frequencies are seen in all things Text, fromBag of Words and document term matrix to …
I am excited to have been chosen as a speaker for the Fall 2017 Chicago Suburb’s SharePoint / Cloud Saturday event! This year the event takes place on November …
Document Term Matrix I like to think Document Term Matrix (DTM) as a implementation of the Bag of Words concept. Document Term Matrix is tracking the term frequency for …
Previously I wrote about how we should be treating our text data as Text and not try to shoehorn this data into a rowset solution. Once we start treating …
For years IT visionaries have been telling us and showing us in charts how there is so much more text data available to us compared to structured, rowset data. …
You don’t have to be a data scientist or statistician to gain some benefits of the machine learning craze going on right now. Don’t’ get me wrong, you should …
Microsoft’s Data Insights Summit will be held in Seattle, WA on June 12-13, 2017. The event will be held at the Washing State Convention Center. The summit will cover …
Microsoft working with edX.org has created a data science track consisting of 15 different online courses. Ten courses must be completed with a passing grade in order to complete …
Microsoft has added two new (Beta) Advanced Analytics certification exams: Perform Cloud Data Science with Azure Machine Learning (70-774) and Perform Data Engineering on Microsoft Azure HDInsight (70-775). These …
Under all the pretty (or not so pretty) GUIs that we use as a development environment there is a framework that does all the work. Integrated Development Environments (IDE) …