Welcome to IIT
Technology of the month
Large Scale Hierarchical Text classification (LSHTC) Pascal Challenge launched!
The LSHTC Challenge is a hierarchical text classification competition using large datasets based on the ODP Web directory data (www.dmoz.org). Hierarchies are becoming ever more popular for the organization of text documents, particularly on the Web. Web directories are an example. Along with their widespread use, comes the need for automated classification of new documents to the categories in the hierarchy. As the size of the hierarchy grows and the number of documents to be classified increases, a number of interesting machine learning problems arise. In particular, it is one of the rare situations where data sparsity remains an issue despite the vastness of available data. The reasons for this are the simultaneous increase in the number of classes and their hierarchical organization. The latter leads to a very high imbalance between the classes at different levels of the hierarchy. Additionally, the statistical dependence of the classes poses challenges and opportunities for the learning methods.


[2] Χορήγηση υποτροφιών για την εκπόνηση διδακτορικής διατριβής - 2011. Καταληκτική ημερομηνία 09/09/2011. [published 02/08/2011]
[3] Χορήγηση υποτροφιών για την εκπόνηση διδακτορικής διατριβής με το University of Loughborough - 2011. NEA καταληκτική ημερομηνία υποβολής αιτήσεων 20/7/2011 [published 28/06/2011]
[4] Study abroad program at the Institute of Informatics & Telecommunications, completed on June 28 [published 28/06/2011]


