Natural Language Processing: What are the best algorithms, papers on entity extraction, relationship extraction from text?
-
Best = Current state-of-the-art methods. Entity = Not just named entities like people, location and organizations. More generally, anything that could be a Wikipedia article title, or WordNet synsets, etc. Essentially best performing methods for going from Wikipedia to Freebase automatically. EDIT: I guess the description was misleading. What I am looking for is algorithms that would automatically create structured databases such as Freebase, WordNet, or ConceptNet, etc. i.e, by relationship extraction I am referring to learning the relationship (horse, is-a, animal) by going through some text corpus.
-
Answer:
I think what you are after is automatic construction of semantic knowledge bases. I have co-authored a survey published this year, which reviews many different methods and their applications: http://onlinelibrary.wiley.com/doi/10.1002/widm.1097/abstract Here is the PDF of the submitted version: http://www.medelyan.com/files/WIDM1097.pdf
Alyona Medelyan at Quora Visit the source
Other answers
Just like Yang Li said, the question asked is a typically entity linking problem, which tries to link the extracted mentions to the entities in the Wikipedia page. I am in a team which participate in this year's Entity Linking Task. Here are the papers you definitely should have looked at. 1. Local and Global Algorithms for Disambiguation to Wikipedia. UIUC Group In this paper, the authors try to form the problem as an optimization problem and provides a list of local and global features, which might be categorized as algorithm part. They also published their system, called http://cogcomp.cs.illinois.edu/page/software_view/Wikifier. You can use their system to do entity disambiguation. When it comes to entity extraction part, you should paper their system paper, called "GLOW TAC-KBP 2011 Entity Linking System". You can get to know how they extract the mentions from the query document. 2. A Neighborhood Relevance Model for Entity Linking, UMass Group As we know, context information is very important for entity linking. If you just focus on the query document, you just get little context for the extracted mentions. This way, the second paper tries to collect the global context for mentions across the entire corpus. 3. Ji Heng's several papers about their system for entity linking. You can go to her website(http://nlp.cs.rpi.edu/publication.html) and search entity linking. They also has existed software. You can get your hand dirty into this field by using their system. 4. HLTCOE Participation at TAC 2012: Entity Linking and Cold Start Knowledge Base Construction, JHU Group Structured prediction is a very hot topic in the field of machine learning. In this paper, they proposed that Structured Prediction Cascades algorithm can be used to deal with entity linking problem. However, according to their paper, they have not implemented it in their system. It is just an idea. So if you want to focus on the algorithm, this idea is deserved for a try. 5. Joint Coreference Resolution and Named-Entity Linking with Multi-pass Sieves, Washington Group Coreference resolution is a very close are with the entity linking. So in this paper, they try to combine the two questions together and the hypothesis is that both task can benefited from each other. There are also a lot of research going on in this field. Just stay tuned.
Jun Xie
I believe you are looking for something a.k.a entity linking? There are a lot of publications recently on this topic. Just search "entity linking" or "entity disambiguation" on google. For off-the-shelf tools, you can refer to Wikifier from UIUC or AIDA from MPI.
Yang Li
Tutorials and papers: http://ejmeij.github.io/entity-linking-and-retrieval-tutorial/ or http://nlp.cs.rpi.edu/kbp/2015/elreading.html Existing systems: http://babelfy.org/index.jsp AIDA: https://gate.d5.mpi-inf.mpg.de/webaida/ Tagme (for queries): http://tagme.di.unipi.it/ Wikifier: http://cogcomp.cs.illinois.edu/page/demo_view/Wikifier
Octavian Ganea
Related Q & A:
- What is the best language to start with?Best solution by Stack Overflow
- What is the best language for HTML parsing and web scraping?Best solution by Quora
- What are different types of Entities in Entity Framework?Best solution by entityframeworktutorial.net
- What's the best nautral foods to eat for natural shiny hair?Best solution by Yahoo! Answers
- What is the best way to improve your language skills?Best solution by Quora
Just Added Q & A:
- How many active mobile subscribers are there in China?Best solution by Quora
- How to find the right vacation?Best solution by bookit.com
- How To Make Your Own Primer?Best solution by thekrazycouponlady.com
- How do you get the domain & range?Best solution by ChaCha
- How do you open pop up blockers?Best solution by Yahoo! Answers
For every problem there is a solution! Proved by Solucija.
-
Got an issue and looking for advice?
-
Ask Solucija to search every corner of the Web for help.
-
Get workable solutions and helpful tips in a moment.
Just ask Solucija about an issue you face and immediately get a list of ready solutions, answers and tips from other Internet users. We always provide the most suitable and complete answer to your question at the top, along with a few good alternatives below.