Chen Zhang & Dmytro Dolgopolov | Entity Disambiguation With Knowledge Graph
KGC 2021 Conference, Workshops and Tutorials
•
21m
During the presentation, we will share our experience in building a knowledge graph leveraging Spark, NLP, and Machine Learning. We will start with explaining the business problems and challenges. Then walk through our data pipeline, including text analytics processes, name similarity solutions, street address normalization, clustering algorithms, confidence level building, etc. At the end we will discuss the business impact and the takeaways.
Dmytro Dolgopolov and Chen Zhang from FINRA, a company that protects investors controlling massive amounts of data as they can run up to 50,000 compute nodes per day and they process up to 135 billion market events per day. This talk will mainly talk about the knowledge graphs that they use are FINRA which are mainly enterprise search and using higher level analytics. In order to explain it, speakers talk about the Entity Disambiguation with Knowledge Graphs where data is extracted from an entity, then they link entity to entity. After linking, clusters are built of these entities and then these clusters allow disambiguation graphs to form which help identify unique entities. Dmytro will give Chen the mic and she provides an insight to these steps and list obstacles they had to overcome to create a system like this. #knowledgegraphs #knowledgegraphconference #knowledgegraphbigdataprocessing #knowledgegraphbusiness
Up Next in KGC 2021 Conference, Workshops and Tutorials
-
Chaitan Baru | Open Knowledge Network
The concept of an Open Knowledge Network (OKN) is one of the components of the National Science Foundation’s Harnessing the Data Revolution (HDR) Big Idea, with the objective of providing semantic information infrastructure. By encoding information and knowledge about real-world entities and thei...
-
Cedric Berger | Data Governance 4.0 A...
Driven by legacy paper-based approaches, the design, conduction and analysis of clinical studies requires the creation and transformation of many data in many different formats. This hinders the process and necessitates significant resources. Having metadata-driven transformation is not new, howe...
-
Branimir Rakic | OriginTrail: Decentr...
Knowledge graphs are powerful tools used by organizations to integrate their siloed data into useful, machine readable information for a wide range of purposes. The OriginTrail Decentralized Knowledge Graph (DKG) extends this approach to enable trusted knowledge exchange between multiple organiza...