Papers by Pratibha Bajpai
The genetic algorithm (GA) is a search heuristic that is routinely used to generate useful soluti... more The genetic algorithm (GA) is a search heuristic that is routinely used to generate useful solutions to optimization and search problems. It generates solutions to optimization problems using techniques inspired by natural evolution, such as inheritance, mutation, selection, and crossover. Genetic algorithms are one of the best ways to solve a problem for which little is known. They are a very general algorithm and so work well in any search space. All you need to know is what you need the solution to be able to do well, and a genetic algorithm will be able to create a high quality solution.

Journal of Computer Science, 2018
The abundance of multilingual content on internet other than English gives an urge to develop inf... more The abundance of multilingual content on internet other than English gives an urge to develop information retrieval system that can cross language boundaries. Such cross lingual information retrieval systems will bridge this language gap and allow user to ask a query in regional language and retrieve relevant documents in a different language. The problem of finding relevant document in language different from source language is the most challenging application of any cross lingual information retrieval. This paper discusses the development process of complete English to Hindi cross language information retrieval system along with the contribution of individual components to the system. The main focus of this paper is to discuss how optimization is done to our disambiguation approach, which we named as 'Two level Disambiguation method'. The experimental results obtained affirm that the addition of a component 'Analyzer' to our CLIR architecture increases the efficiency of our proposed disambiguation algorithm.

International Journal of Electrical and Computer Engineering (IJECE), 2018
Selection of the most suitable translation among all translation candidates returned by bilingual... more Selection of the most suitable translation among all translation candidates returned by bilingual dictionary has always been quiet challenging task for any cross language query translation. Researchers have frequently tried to use word co-occurrence statistics to determine the most probable translation for user query. Algorithms using such statistics have certain shortcomings, which are focused in this paper. We propose a novel method for ambiguity resolution, named ‘two level disambiguation model’. At first level disambiguation, the model properly weighs the importance of translation alternatives of query terms obtained from the dictionary. The importance factor measures the probability of a translation candidate of being selected as the final translation of a query term. This removes the problem of taking binary decision for translation candidates. At second level disambiguation, the model targets the user query as a single concept and deduces the translation of all query terms si...

International Journal of Research in Engineering and Technology, 2014
Multilingual information is overflowing on internet these days. This increasing diversity of web ... more Multilingual information is overflowing on internet these days. This increasing diversity of web pages in almost every popular language in the world should enable the user to access information in any language of his choice. But sometimes it is difficult for a user to write her request in a language which she could easily read and understand. This makes cross-language information retrieval (CLIR) and multilingual information retrieval (MLIR) for Web applications a valuable need of the day. It increases the accessibility of web users to retrieve information in any language while post their queries in their native language. The paper critically analyzes the various researchers work in the area of Indian language CLIR. In this paper we also present our prospective prototype for English to Hindi language CLIR. It will also discuss the issues related to the English to Hindi language translation. We had tested 30 queries manually using suggested prototype and found that the precision level is quite good.

CompSciRN: Frameworks (Topic), 2016
Internet usage is increasing at a rapid rate. Anyonecan access all sorts of information from the ... more Internet usage is increasing at a rapid rate. Anyonecan access all sorts of information from the web at any time. But the language diversity of web pages restricts this access of information. Cross Lingual information Retrieval provides the solution to this problem, by allowing user to post their query in the native language and retrieve the documents in otherlanguages. Machine readable dictionaries are the mosteconomical resource of cross language translation. These dictionaries usually provide more than one translations in target language for source query terms. These translations need to be disambiguated to achieve the best translation for a query word.Once we achieve the correct translations, the cross lingual information system can match the monolingual informationretrieval on performance grade. In this paper we propose a framework for query translation and disambiguation to achieve an efficient cross lingual information system.
Indonesian Journal of Electrical Engineering and Informatics (IJEEI)
Uploads
Papers by Pratibha Bajpai