Professional Data Scientist Network
Enables flexible, scalable personalization for increased customer engagement and lifetime value.
A recommender is a type of machine learning algorithm that enables personalization. The technique is also known as collaborative filtering. Given user-item interaction data, recommendation systems can recommend new items for the users based on their past interactions. A recommender system can also detect similar items or similar users: which items draw the same crowd, and which group of users like the same items. Personalized recommenders are widely used on movie rental or music sharing sites, as well as retail and news outlets. Good recommenders are proven to increase user interaction and drive revenue.
Classification is the problem of making a discrete prediction using training data. The key difference between regression and classification is that in regression the target is continuous while in classification, the target is categorical. A classifier can be used for several applications including:
Works with all your data
All models can incorporate the following set of rich feature types:
In addition to these feature types, models like the neural networks, boosted decision trees, and random forests can be used for feature extraction.
ClusteringGroups data points into clusters that are similar to each other.
Clustering is the fundamental machine learning task of separating data into similar groups where there aren't nice class labels in a training dataset. Clustering is often done in the exploratory data analysis phase to get a better intuition about the structure of a dataset, or as a preliminary step for more complicated models. There are countless clustering algorithms, but very few implementations scale well to the size of modern datasets.
Our clustering tools work with the highly efficient and optimized SFrame tabular data structure, which means they efficiently scale to very large datasets.
Provides users with state-of-the-art algorithms for classification. The resulting trained models can also transform input into features that are useful in the context of other machine learning tasks such as regression, clustering, or finding nearest neighbors.
Provides the most flexible way to search for data points that are similar to a set of queries, and does so at interactive speed.
With GPU, training deep networks on millions of images will become feasible and easy, with throughput rates of ~200 images/second on a single GPU. Multiple GPU's allow for nearly linear scaling of throughput, meaning that you will be able to iterate and experiment with different architectures and parameters.
Finding the nearest neighbors of a set of query data points is a core component of many machine learning algorithms. The nearest neighbor classifier, for example, predicts the label for a query point based on the labels of the closest points in the training set. "Composite distance" functions allow comparisons between data points with any mix of data types, like numeric values and free text.
Recommendation engines can increase customer engagement which generally results in higher customer lifetime value. Recommendations can be based either on user preferences or intense categorization of the items or content.
Recommendation engines are often used for:
Recommendation engines are a commonly referenced application of big data analytics. We encounter these systems at retailers who show us products typically purchased together or at online services that curate books and music for us based on our preferences. Recommendation engines have significant proven ROI because they consistently boost sales as well as customer satisfaction.
React to early indicators of customer dissatisfaction to reduce loss of users and recurring revenue with customer churn models.
Predicting customer “churn” - when a customer will leave a provider of a product or service in favor of another - is a valuable application for machine learning. The ability to prognosticate this customer move requires establishing correlations across a wide variety of data including communications types and frequency that might signal preferences and finally intent. Churn prediction is particularly important in the telecommunications industry where a small number of mobile services providers must compete for a relatively finite customer base. The analysis has to be conducted frequently to ensure continued customer satisfaction through improved customer service and targeted offers. It also has to be applied to a large and varied body of data which grows exponentially and in lock step with the size of the subscriber’s social network.
Personalize your customer interactions and increase customer loyalty with customer segmentation machine learning models.
Data science and by extension analytics are transforming marketing into a highly targeted, contextual activity that aims at ideally matching the product to buyer needs and wants. By analyzing customer purchase histories and patterns of interaction with the product and service, sellers can, not only refine their offers to well known market segments but also identify entirely new segments whose preferences were previously hidden in mounds of data.
Identify and prevent illegal financial activity faster with fraud detection machine learning models.
Machine learning holds a great deal of promise for the area of fraud detection analytics (FDA). While it’s still early days for the discipline, it is well understood that analysis of financial transactions, email, customer relationships and communications can help identify fraudulent activity and even predict it before it has occurred saving financial services firms millions in lost revenue.
Understand customer preferences, feedback, and intent more accurately with sentiment analysis machine learning models.
Sentiment analysis essentially amounts to text and document classification but instead of topic the identifier is sentiment, as in a positive review or a secretive tone. Sentiment analysis is made complex because looking for keywords often is not enough to infer a sentiment from a sentence or a string of words. This makes sentiment analysis a big data problem that can benefit from machine learning models. Accurate sentiment identification has a myriad of applications from making recommendations more accurate, to sorting documents, books and reviews for easy classification. Identifying a person’s intent can also help in fraud detection, churn prediction as well as many other business intelligence applications.
We provide high performance algorithms for: