Chen Zhang & Dmytro Dolgopolov | Entity Disambiguation With Knowledge Graph
KGC | The Complete Collection
•
21m
During the presentation, we will share our experience in building a knowledge graph leveraging Spark, NLP, and Machine Learning. We will start with explaining the business problems and challenges. Then walk through our data pipeline, including text analytics processes, name similarity solutions, street address normalization, clustering algorithms, confidence level building, etc. At the end we will discuss the business impact and the takeaways.
Dmytro Dolgopolov and Chen Zhang from FINRA, a company that protects investors controlling massive amounts of data as they can run up to 50,000 compute nodes per day and they process up to 135 billion market events per day. This talk will mainly talk about the knowledge graphs that they use are FINRA which are mainly enterprise search and using higher level analytics. In order to explain it, speakers talk about the Entity Disambiguation with Knowledge Graphs where data is extracted from an entity, then they link entity to entity. After linking, clusters are built of these entities and then these clusters allow disambiguation graphs to form which help identify unique entities. Dmytro will give Chen the mic and she provides an insight to these steps and list obstacles they had to overcome to create a system like this. #knowledgegraphs #knowledgegraphconference #knowledgegraphbigdataprocessing #knowledgegraphbusiness
Up Next in KGC | The Complete Collection
-
Chris Welty | Shopping Sense: Bringin...
Knowledge Graphs (KGs) continue to penetrate the industrial world after Google's famous "things not strings" was used to explain their acquisition of FreeBase ten years ago. While many KGs exist, they are by and large little more than "entity catalogs", missing entirely the links between those e...
-
Dan McCreary | Graph Hardware Is Coming!
In this presentation we will show how current general-purpose CPU hardware fails to deliver high performance graph analytics. We show that by doing a detailed analysis of the actual hardware functionally needed by graph queries (pointer jumping), we can redesign hardware that is optimized for fas...
-
Freddy Lecue | On The Role Of Knowled...
Machine Learning (ML), as one of the key drivers of Artificial Intelligence, has demonstrated disruptive results in numerous industries. However one of the most fundamental problems of applying ML, and particularly Artificial Neural Network models, in critical systems is its inability to provide ...