This is a crude abstract, Basically I wish to know if someone can mentor this project and its feasability with taxanomy ..
ABSTRACT
Online content has become huge and is scattered all over the internet, data present is not classified properly and is not linked properly and hence not very easy to find. Current search engines search on the basis of keywords and only very few keep context in mind, Semantic web steps in to solve the problem by classifying data according to a set of rules called ontologies and establish relation between data. Ontologies are explicit in nature and some exist for the field of research, medical, Journals etc but this is unusable for a vast entity like world wide web as its not humanly possible to create ontology of everything manually.This makes semantic web difficult to come in mainstream web, It becomes important to have a engine which can classify incoming data with the help of seed ontology and its own metadata.
Folksonomy is a new phenomena in web 2.0 where people have started labelling their content with metadata usually called tags. This brings in human element and thus give some chance for contextual data to come in picture. My idea is to read these tags of the data and build a ontology/taxanomy on the basis of seed ontology/taxanomy where classification may not be perfect but will become automated and hence usable.
Drupal is an open source CMS which supports concept of taxanomy and vocabulary in its framework which I plan to leverage and extend it to build a classifier. This engine take data from various forums, community bulletins and portals like youtube, flickr etc and try to classify the data into existing vocabulary and establish relation between the data. Using the same technique I can also execute tag cleaning which simply remove data which doesn't fit in the class.
Keywords: Semantic Web, Tag Cleaning, Folksonomy, Ontology, Classification, Taxanomy.

Comments
I will write application
I will start writing my application once I get some initial feedback .. I am pretty late at waking up to SoC I realize :|
Thanks
Dipen Chaudhary
http://dipenchaudhary.com
http://playdrupal.com (under construction)
Dipen Chaudhary
Founder, QED42 http://www.qed42.com Drupal development
For your Reference
I got this from tom gruber so I am including it here ..
http://sioc-project.org/drupal
He doesn't have time to mentor/guide this project and I really could use some help ..
Dipen Chaudhary
http://dipenchaudhary.com
http://playdrupal.com (under construction)
Dipen Chaudhary
Founder, QED42 http://www.qed42.com Drupal development
Reply from reinhard
Hi Dipen,
some more thoughts:
- your idea is a pretty demanding one, that easily could fill e.g. a master
thesis. You have to be realistic while planning
- as much as possible of the classifier code should not be drupal specific,
maybe you can even realize it as a kind of small web service and then
integrate it with a drupal module (I guess, it makes most sense to realize it
in php still, as that is available at most providers)
- you should add the possibility of having a "moderated" ontology generation:
the software suggests terms and relationships but a moderator can easily
accept/reject the suggestions (one such ontology generator that I know, which
generates ontologies from heaps of web pages gets about half of its
suggestions accepted by a morderator/"ontology expert")
- if I am right, you need lots (and lots (and lots)) of tags to make up such
an ontology. having the possibility to use additional tags could make sense
Have you already written a proposal for SOC? It's okay if you add my address
and keep me informed, of course!
Cheers,
Reinhard
Dipen Chaudhary
http://dipenchaudhary.com
http://playdrupal.com (under construction)
Dipen Chaudhary
Founder, QED42 http://www.qed42.com Drupal development
Taxonomy directed Folksonomy
Hi Dipen
I started working on the Taxonomy directed Folksonomy( Drupal).I am doing the ground work for that.I think we are working on the same thing.I need some assistance from you regarding the modules to be worked on.
Regards
Bhanu Prakash Gogineni