10.14489/vkit.2024.07.pp.042-053

DOI: 10.14489/vkit.2024.07.pp.042-053

Коновалов В. А.
СПОСОБ МАРКИРОВКИ СВОИХ ЦЕПОЧЕК ВХОЖДЕНИЙ СЛОВОМ В N-СХЕМЕ АЛГОРИТМА МАРКОВА
(c. 42-53)

Аннотация. Рассмотрен способ маркировки своих цепочек вхождений словом в N-схеме алгоритма Маркова, необходимый для обнаружения и классификации λ-пустот в данных источников. Перечислены элементы базы данных, используемой N-схемой алгоритма Маркова для записи классифицированных объектов и морфизмов в топосе. Подвержена критическому анализу идея работы с такой базой данных для маркирования своих цепочек вхождений, поскольку при этом необходим второй проход по данным, хранящимся в базе. Маркирование своих цепочек вхождений в N-схеме алгоритма Маркова можно осуществлять двумя основными способами: номером морфизма и первым словом в цепочке. Показан механизм реализации управления в N-схеме алгоритма Маркова при повторном проходе по данным в базе. Рассмотрен способ маркирования своих цепочек вхождений словом, при котором при первом проходе по данным в базу данных заносится первое слово в цепочке в качестве ее уникального маркера. Исследован механизм «сшивания» цепочек вхождений в топосе, при котором первые слова в сшиваемых цепочках объединяются в кортеж и образуют композиции, уникальные для каждой такой цепочки. Установлено, что для обнаружения λ-пустот необходимо перерабатывать все множество таких кортежей. В этих целях таковые придется поместить либо в память, либо в файл, либо в базу данных, так или иначе реализовывая алгоритм переработки этих кортежей вторым проходом по данным. Показано, что длина маркеров, образованных словами, может быть значительной, поэтому целесообразно свернуть таковые относительно какого-либо морфизма, например хеш-функции, синтезировав топос хешей. Таким образом целесообразно исследовать свойства топоса хешей с использованием алфавита M N-схемы алгоритма Маркова для кодирования вхождений относительно предыдущих, а также синтезировать способ маркирования своих цепочек вхождений в топосе хешей.

Ключевые слова: алгоритм Маркова; конструктивная математика; теория категорий; теория алгоритмов; искусственный интеллект.

Konovalov V. A.
THE METHOD OF MARKING ITS CHAINS OF OCCURRENCES OF A WORD IN THE N-SCHEME OF THE MARKOV ALGORITHM
(pp. 42-53)

Abstract. In order to detect and classify -voids in these sources, a method of labeling its chains of occurrences with a word is investigated using the N-scheme of the Markov algorithm. The keys of the database used by the N-schema of the Markov algorithm to record the classified objects and morphisms in the topos are considered as input data for the algorithm that performs the second pass through the data for classifying -voids. The idea of a second pass through the data, using the database, is subjected to critical analysis, in terms of the cost of its implementation and the search for ways to build an algorithm that does not require a second pass through the data. Marking your chains of occurrences in the N-scheme of the Markov algorithm can be done in two main ways, based on the morphism number and the first word in the chain. Marking your chains of occurrences in the N-scheme of the Markov algorithm can be done in two main ways, based on the morphism number and the first word in the chain. This property distinguishes the N-scheme of the Markov algorithm from other algorithms. The control implementation mechanism in the N-scheme of the Markov algorithm is considered for the case of organizing a repeated pass through the data in the database. The method proposed in the study for marking one’s chains of occurrences with a word is considered, in which, during the first pass through the data, the first word in the chain is entered into the database as its unique marker. The mechanism of “stitching” of chains of occurrences in a topos is studied with a different composition of words in chains, a case is highlighted in which the first words in the “stitched” chains are combined into a tuple and form compositions that are unique for each such chain and can be used to mark them. In the course of the study, it was found that in order to detect -voids, it is necessary to process the entire set of such tuples, so they will either have to be placed in memory, or in a file, or in a database, thus, one way or another, implement the algorithm for processing these tuples second pass through the data. Some negative aspects of this approach have been established, for example, the length of markers formed by words can be significant, so it is advisable to collapse them with respect to some function, for example, a hash, synthesizing topos hashes for all given data. It is concluded that it is expedient to study the properties of the hash topos using the alphabet of the M N-scheme of the Markov algorithm for encoding occurrences relative to the previous ones and to synthesize a method for marking own chains of occurrences in the hash topos.

Keywords: Markov algorithm; Constructive mathematics; Category theory; Theory of algorithms; Artificial Intelligence.

+ - Информация об авторах (About the Authors) Click to collapse

Рус

В. А. Коновалов (ООО «Курский мясоперерабатывающий завод», Железногорск, Курская обл., Россия) E-mail: Этот e-mail адрес защищен от спам-ботов, для его просмотра у Вас должен быть включен Javascript

Eng

V. A. Konovalov (LLC “Kurskiy Myasopererabatyvayushij Zavod”, Zheleznogorsk, Kurskaya oblast, Russia) E-mail: Этот e-mail адрес защищен от спам-ботов, для его просмотра у Вас должен быть включен Javascript

+ - Библиографический список (References) Click to collapse

Рус

1. Коновалов В. А. Классификатор объектов больших данных (big data) социально-экономической системы // Вестник компьютерных и информационных технологий. 2022. Т. 19, № 2. C. 32 – 39. DOI: 10.14489/vkit.2022.02.pp.032-039
2. Коновалов В. А. Ассоциаторы в N-схеме алгоритма Маркова // Вестник компьютерных и информационных технологий. 2023. Т. 20, № 2. C. 44 – 53. DOI: 10.14489/vkit.2023.02.pp.044-053
3. Коновалов В. А. Уточнение алфавита Маркова для представления синонимов и омонимов, обозначающих объекты больших данных социально-экономических систем. Часть 1 // Вестник компьютерных и информационных технологий. 2022. Т. 19, № 8. C. 37 – 48. DOI: 10.14489/vkit.2022.08. pp.037-048
4. Коновалов В. А. Использование алгоритмов Маркова для исследования -пустот в больших данных социально-экономических систем. Часть 1 // Вестник компьютерных и информационных технологий. 2022. Т. 19, № 6 C. 29 – 39 DOI: 10.14489/vkit.2022.06.pp.029-039
5. Коновалов В. А. Использование алгоритмов Маркова для исследования -пустот в больших данных социально-экономических систем. Часть 2 // Вестник компьютерных и информационных технологий. 2022. Т. 19, № 7. C. 30 – 41. DOI: 10.14489/vkit.2022.07.pp.030-041
6. Марков А. А., Нагорный Н. М. Теория алгорифмов. М.: Наука, 1984. 432 с. (Мат. логика и основания математики). 2-е изд., испр. и доп. М.: Фазис, 1996. 493 с.
7. ГОСТ Р ИСО/МЭК 20546–2021. Информационные технологии. Большие данные. Обзор и словарь. М.: Стандартинформ, 2021. 21 с.

Eng

1. Konovalov V. A. (2022). Сlassifier of big data objects of the socio-economic system. Vestnik komp'yuternyh i informatsionnyh tekhnologiy, 19(2), 32 – 39. [in Russian language] DOI: 10.14489/vkit.2022.02.pp.032-039
2. Konovalov V. A. (2023). Associators in the N -scheme of the markov algorithm. Vestnik komp'yuternyh i informatsionnyh tekhnologiy, 20(2), 44 – 53. [in Russian language] DOI: 10.14489/vkit.2023.02.pp.044-053
3. Konovalov V. A. (2022). Refinement of the Markov alphabet to represent synonyms and homonyms denoting objects of big data of socio-economic systems. Part 1. Vestnik komp'yuternyh i informatsionnyh tekhnologiy, 19(8), 37 – 48. [in Russian language] DOI: 10.14489/vkit.2022.08. pp.037-048
4. Konovalov V. A. (2022). The use of Markov algorithms for the study of -voids in big data of socio-economic systems. Part 1. Vestnik komp'yuternyh i informatsionnyh tekhnologiy, 19(6), 29 – 39. [in Russian language] DOI: 10.14489/vkit.2022.06.pp.029-039
5. Konovalov V. A. (2022). The use of Markov algorithms for the study of -voids in big data of socio-economic systems. Part 2. Vestnik komp'yuternyh i informatsionnyh tekhnologiy, 19(7), 30 – 41. [in Russian language] DOI: 10.14489/vkit.2022.07.pp.030-041
6. Markov A. A., Nagorniy N. M. (1984). Theory of algorithms. Moscow: Nauka. (Mathematical logic and foundations of mathematics). 2nd ed. Moscow: Fazis. [in Russian language]
7. Information Technology. Big data. Review and Dictionary. (2021). National Standard No. GOST R ISO/MEK 20546–2021. Moscow: Standartinform. [in Russian language]

+ - Заказать электронную версию статьи (Purchase digital version of a single article) Click to collapse

Рус

Статью можно приобрести в электронном виде (PDF формат).

Стоимость статьи 500 руб. (в том числе НДС 20%). После оформления заказа, в течение нескольких дней, на указанный вами e-mail придут счет и квитанция для оплаты в банке.

После поступления денег на счет издательства, вам будет выслан электронный вариант статьи.

Для заказа скопируйте doi статьи:

10.14489/vkit.2024.07.pp.042-053

и заполните форму

Отправляя форму вы даете согласие на обработку персональных данных.

Eng

This article is available in electronic format (PDF).

The cost of a single article is 500 rubles. (including VAT 20%). After you place an order within a few days, you will receive following documents to your specified e-mail: account on payment and receipt to pay in the bank.

After depositing your payment on our bank account we send you file of the article by e-mail.

To order articles please copy the article doi:

10.14489/vkit.2024.07.pp.042-053

and fill out the form