Text documents in the web are in hierarchy, increase in the content, information grows over the years. To classify those text documents, need a class labels. But documents in the corpus belong to more than one class or category. Most of the corpus is large in size example. Wikipedia, Yahoo ODP directory. To classify those large-Scale dataset need a multi-label to categorize those datasets. More number of document added to the hierarchy, it create very high imbalance between classes at the different levels of hierarchy. Difficult to assign the documents to the actual class, so that relevance measure is used to calculate, relevance of text document to the class label, to maintain stable hierarchy. Another issue is if number of unique label is increase, it create instability in a classification, and also slow the classification process, so that try to limit the unique label in the classification, it improves the classification performance.
Prasath, M.Balaji and Manjula, D.
"Issues in Large-Scale Hierarchical Classifications,"
International Journal of Computer Science and Informatics: Vol. 1
, Article 15.
Available at: https://www.interscience.in/ijcsi/vol1/iss3/15