Graph Algorithms in SQL
Similarity Algorithm (Jaccard)
The Similarity algorithm is used to measure similarities between different sets of nodes or entities in a network. This is done by analyzing the data in each set and giving each set a score. The sets are then compared to each other and given a final score ranging between 0 and 1, where 0 is no match at all, 0.5 is a slight match, and 1 is a perfect match. Timbr’s GA library includes the popular Jaccard Similarity algorithm.
Similar entities identified for each set
Sample use cases
Recommendations
Recommending products or services based on similar customers purchase history
Marketing
Detecting similar audiences for targeted advertising
Health Care
Predicting patients’ health status based on health data and patient similarity
Text Mining
Identifying similarities between multiple text documents, web pages, etc.
Social Media
Suggesting new relationships based on interests, behavior, etc.