Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
…
3 pages
1 file
Current survey done on today's scenario shows, result gadget declared by Universities(eg. Pune Uni.) for engineering is in PDF file format. The PDF data contents detail such as seat no, centre, permanent registration no.(PRN), Name, Subjects, Marks, etc. Presently PDF file is extracted in excel file format, this conversion is done in order to extract various reporting formats required by department/college/university at various level. Thus, it involves somewhat manual process. However, all these operation have certain limitations such as semi-automated process, no GUI present, SMS gateway is not support, E-mail gateway is not supported, and mainly graphical analysis of data is not available. On the basis of survey done, we came across existing applications which are semi-automated or automated with some restrictions which does not allow full automation of result analysis in proper format. Thus none of the applications supported the full automation. To overcome above said drawbac...
Current survey done on today’s scenario shows, result gadget declared by Universities(eg. Pune Uni.) for engineering is in PDF file format. The PDF data contents detail such as seat no, centre, permanent registration no.(PRN), Name, Subjects, Marks, etc. Presently PDF file is extracted in excel file format, this conversion is done in order to extract various reporting formats required by department/college/university at various level. Thus, it involves somewhat manual process. However, all these operation have certain limitations such as semi-automated process, no GUI present, SMS gateway is not support, E-mail gateway is not supported, and mainly graphical analysis of data is not available. On the basis of survey done, we came across existing applications which are semi-automated or automated with some restrictions which does not allow full automation of result analysis in proper format. Thus none of the applications supported the full automation. To overcome above said drawbacks, we proposed a new system for result analysis, which is automated with features like Auto-output generation in different database format like excel, PDF, Mysql for further compatibility with other ERP system as per user selection, active SMS gateway, active Email gateway, interactive and user friendly GUI, graphical result analysis with text. In Proposed system we have targeted the limitations to provide effective solution for result analysis. This system will also work on current grade system. Where we are going to maintain database of students which will show whole status of students. Automated solutions provided by the system will make exam department activities more efficient by covering most of the important drawbacks of manual system, namely speed, precision and simplicity. It will also work as a generalized system to support any type and format of PDF file. A centralized system will ensure that the activities in the context of an examination can be managed effectively, while also making it more accessible and convenient for both staff and students.
International Journal of Recent Technology and Engineering (IJRTE), 2019
Automation is the future for organizational processes. Robotic Process Automation (RPA) is the solution for software automation in various domains like IT, Finance and accounting, Supply chain and so on. In this paper we propose a RPA solution for education domain. This paper shows the automation process for result analysis of student’s examination results. The automation process takes input as the university result in pdf form. We performed automation on this input file using Automation anywhere tool. Our result shows that all the work is error free. Also time required for this analysis is around 94.44% less as compared to manual analysis by human.
2021
The massive production of documents in portable document format (PDF) format has motivated research on automated extraction of data contained in these files. This work is mainly focused on extractions of natively digital PDF documents, made available in large repositories of educational exams. For this, the educational tests applied at Enade were used and collected automatically using scripts developed with Scrapy. The files used for the evaluation comprise 343 tests, with 11.196 objective and discursive questions, 396 answers, with 14.475 alternatives extracted from the objective questions. For the construction of ground truth in the tests, the Aletheia tool was used. For the extractions, existing tools were used that perform data extractions in PDF files: tabular data extractions, with Excalibur and Tabula for answer extractions, textual content extractions, with CyberPDF and PDFMiner to extract the questions, and extractions of regions of interest, with Aletheia and ExamClipper f...
In today's era of computerized banking, management, billings and what not, we use tabular data in every sector. The most commonly used format of storing tabular data by us is through excel format. It is very easy to retrieve information from excel sheets. But, tabular data extraction from PDFs or images has remained as an inherent problem since many years. In order to reduce such issues and automate the process we have designed a system using artificial intelligence that can take a PDF or an image as an input and outputs a CSV or excel file directly with the extracted tabular data.
International Journal of New Computer Architectures and Their Applications, 2011
IRJET, 2022
Information Extraction from PDFs for analysis is a common sight in the corporate world. The manual work done by the analysts consumes time depending on the size of the annual reports they are referring to. It also hinders the scalability of the process. Therefore, automation of data analysis for the analysis of PDFs is a necessity today. Hence this paper provides an algorithm by which information can be extracted from the PDFs and mapped to various categories of interest. The categories of interest can be varied, depending on the requirements by the user. The text extraction can be done using simple modules like PDFMiner. However, the dictionary creation has to be done for the sentences to be mapped to particular topics. Using rule- based filters will help extract the required sentences without much consumption of memory and can be understood very easily compared to complex procedures in the algorithm. The proposed algorithm simplifies the entire process of information extraction by providing a broad framework inside the algorithm that can be further modified based on the interests of the user
Technologies, 2019
In the age of digitalization, the collection and analysis of large amounts of data is becoming increasingly important for enterprises to improve their businesses and processes, such as the introduction of new services or the realization of resource-efficient production. Enterprises concentrate strongly on the integration, analysis and processing of their data. Unfortunately, the majority of data analysis focuses on structured and semi-structured data, although unstructured data such as text documents or images account for the largest share of all available enterprise data. One reason for this is that most of this data is not machine-readable and requires dedicated analysis methods, such as natural language processing for analyzing textual documents or object recognition for recognizing objects in images. Especially in the latter case, the analysis methods depend strongly on the application. However, there are also data formats, such as PDF documents, which are not machine-readable a...
Information extraction (IE) aims at extracting specific information from huge amount of documents. Now a day's internet became a great source of information and contains immeasurable amount of data which makes it tedious for normal users to retrieve relevant data, therefore it is a demand of present time to have a efficient information extraction system that convert web pages and their data into user friendly structures for this purpose many extraction system has been developed with variable performance this paper will going to throw light on such one IE system. This research paper introduces a method that uses rule based technique to induce an extraction. This research paper enables the user to gather more relevant piece of information and helps to improve the search keyword to extract efficient desirable knowledge for end user
Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.
Journal of Advanced Management Science, 2016
2013
Iicai, 2009
2019
Communications in Computer and Information Science, 2018
International Journal of Applied Engineering Research
International journal of Advances in Engineering and Management (IJAEM), 2021
International Journal for Research in Applied Science & Engineering Technology (IJRASET), 2023
isara solutions, 2020
CERN European Organization for Nuclear Research - Zenodo, 2022