Intelligent Indexing and Sorting Management System – Automated Search Indexing and Sorting of Various Topics

IF 0.6 4区工程技术 Q4 Engineering

Nuclear Engineering International Pub Date : 2020-12-20 DOI:10.18034/EI.V8I2.554

Apoorva Ganapathy, Takudzwa Fadziso

{"title":"Intelligent Indexing and Sorting Management System – Automated Search Indexing and Sorting of Various Topics","authors":"Apoorva Ganapathy, Takudzwa Fadziso","doi":"10.18034/EI.V8I2.554","DOIUrl":null,"url":null,"abstract":"An issue that the majority of the databases face is the static and manual character of indexing activities. This traditional method of indexing and sorting different topics is confirmed to shake the dataset performance somewhat, making downtime and a potential effect in the presentation that is normally addressed by manually indexing operations. Numerous data mining methods can accelerate this process by using proper indexing structures. Choosing the appropriate index generally relies upon the kind of operation that the algorithm performs against the dataset. Topic indexing is the operation of recognizing the principal topics covered by a document. These are helpful for some reasons: as subject headings in libraries, as keywords in scholarly articles, and as hashtags on social media platforms. Knowing a document’s topic assists individuals with deciding its importance quickly. In any case, assigning topics manually is a tedious and redundant task. This paper shows the best way to create them automatically in a way that contends with manual indexing done by humans. This paper also talks about the issues and the techniques for identifying applicable data in a huge variety of documents. The contribution of this thesis to this issue is to foster better content analysis techniques that can be utilized to describe document content with automated index terms. Index terms can be used as meta-data that defines documents and is utilized for seeking various topics. The main point of this paper is to show the way toward creating an automatic indexer which analyzes the topic of documents by integrating proof from word frequencies and proof from the linguistic analysis given by a syntactic parser. The indexer weighs the expressions of a document as per their assessed significance for depicting the topic of a given document based on the content analysis.","PeriodicalId":49736,"journal":{"name":"Nuclear Engineering International","volume":"5 1","pages":""},"PeriodicalIF":0.6000,"publicationDate":"2020-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nuclear Engineering International","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.18034/EI.V8I2.554","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Engineering","Score":null,"Total":0}

引用次数: 2

Abstract

An issue that the majority of the databases face is the static and manual character of indexing activities. This traditional method of indexing and sorting different topics is confirmed to shake the dataset performance somewhat, making downtime and a potential effect in the presentation that is normally addressed by manually indexing operations. Numerous data mining methods can accelerate this process by using proper indexing structures. Choosing the appropriate index generally relies upon the kind of operation that the algorithm performs against the dataset. Topic indexing is the operation of recognizing the principal topics covered by a document. These are helpful for some reasons: as subject headings in libraries, as keywords in scholarly articles, and as hashtags on social media platforms. Knowing a document’s topic assists individuals with deciding its importance quickly. In any case, assigning topics manually is a tedious and redundant task. This paper shows the best way to create them automatically in a way that contends with manual indexing done by humans. This paper also talks about the issues and the techniques for identifying applicable data in a huge variety of documents. The contribution of this thesis to this issue is to foster better content analysis techniques that can be utilized to describe document content with automated index terms. Index terms can be used as meta-data that defines documents and is utilized for seeking various topics. The main point of this paper is to show the way toward creating an automatic indexer which analyzes the topic of documents by integrating proof from word frequencies and proof from the linguistic analysis given by a syntactic parser. The indexer weighs the expressions of a document as per their assessed significance for depicting the topic of a given document based on the content analysis.

查看原文本刊更多论文

智能索引和排序管理系统-各种主题的自动搜索索引和排序

大多数数据库面临的一个问题是索引活动的静态和手动特征。这种对不同主题进行索引和排序的传统方法被证实会在一定程度上影响数据集的性能，造成停机时间和对演示文稿的潜在影响，通常可以通过手动索引操作来解决。许多数据挖掘方法都可以通过使用适当的索引结构来加速这一过程。选择合适的索引通常取决于算法对数据集执行的操作类型。主题索引是识别文档所涵盖的主要主题的操作。这些词之所以有用，是因为它们可以作为图书馆的主题标题、学术文章的关键词，以及社交媒体平台上的标签。了解文档的主题可以帮助人们快速确定其重要性。在任何情况下，手动分配主题都是一项乏味且冗余的任务。本文展示了自动创建索引的最佳方法，这种方法可以与人工索引相媲美。本文还讨论了在大量文档中识别适用数据的问题和技术。本文对这个问题的贡献在于培养更好的内容分析技术，这些技术可用于用自动索引术语描述文档内容。索引项可以用作定义文档的元数据，并用于查找各种主题。本文的主要目的是通过将词频证明和句法分析器给出的语言分析证明结合起来，实现自动索引器对文档主题的分析。索引器根据对描述基于内容分析的给定文档主题的评估意义对文档的表达式进行加权。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Nuclear Engineering International 工程技术-核科学技术

自引率

0.00%

发文量

审稿时长

6-12 weeks

期刊介绍： Information not localized