Discussion:
[DBpedia-discussion] Unsupervised Learning of DBpedia Taxonomy: DBTax
Shashank Motepalli
2017-05-18 06:15:21 UTC
Permalink
Hi everyone,

I am Shashank Motepalli. I would be working on Unsupervised learning of
DBpedia Taxonomy Project this summer as part of Google Summer of Code
program.(Project Link
<https://summerofcode.withgoogle.com/projects/#5713033692184576>)

Mentors: Marco Fossati and Dimitris Kontokostas
Here is the link to my Proposal( Link
<https://docs.google.com/document/d/1CuVIPK5zMr4ykhoTaM9QUPJidDTZvzckD2kurpHK5qI/edit?usp=sharing>)
and my progress can be tracked at progress page( Page Link
<https://github.com/dbpedia/DBTax/wiki/Shashank-Motepalli:-GSoC-2017>).

About the project:
DBpedia tries to extract structured information from Wikipedia and make
information available on the Web. In this way, the DBpedia project develops
a gigantic source of knowledge. However, the current system for building
DBpedia Ontology relies on Infobox extraction. Infoboxes, being human
curated, limit the coverage of DBpedia. This occurs either due to lack of
Infoboxes in some pages or over-specific or very general taxonomies. These
factors have motivated the need for DBTax.

DBTax follows an unsupervised approach to learning taxonomy from the
Wikipedia category system. It applies several inter-disciplinary NLP
techniques to assign types to DBpedia entities. The primary goal of the
project is to streamline and improve the approach which was proposed. As a
result, making it easy to run on a new DBpedia release. In addition to
this, also to work on learning taxonomy of DBTax to other Wikipedia
languages.
Thanks.
Regards,
Shashank Motepalli

Loading...