![]() This puts tremendous pressures on the Indian government to not only have comprehensive language policies, but also to create resources for their maintenance and development. ![]() India is considered a linguistic ocean with 4 language families and 22 scheduled national languages, and 100 un-scheduled languages reported by the 2001 census. This paper addresses all these problems and presents a transliteration based search engine (inSearch) which is capable of searching 10 multi-script and multiencoded Indian languages content on the web. Most of the users hardly know how to type in their native language and prefer to access the information through English based transliteration. (3) Finally the query input mechanism is another major problem. (2) Inspite of Indian languages sharing common phonetic nature, common words like loan words (borrowed from other languages like Sanskrit, Urdu or English), transliterated terms, pronouns etc., can not be searched across languages. ![]() In the Indian scenario, majority web pages are not searchable or the intended information is not efficiently retrieved by the search engines due to the following: (1) Multiple text-encodings are used while authoring websites. ![]() Most of the Internet data for Indian languages exist in various encodings, causing difficulties in searching for the information through search engines.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |