1. DES SURYANI - Doctoral Program of Technology and Vocational Education, Faculty of Engineering - Universitas Negeri Padang, Indonesia & Study Program of Informatics Engineering, Faculty of Engineering - Islamic University of Riau, Indonesia
2. AMBIYAR - Doctoral Program of Technology and Vocational Education, Faculty of Engineering - Universitas Negeri
Padang, Indonesia.
3. ASRUL HUDA - Doctoral Program of Technology and Vocational Education, Faculty of Engineering - Universitas Negeri Padang, Indonesia.
4. WILDA SRIHASTUTY HANDAYANI PILIANG - Study Program of Indonesian Language, Faculty of Education - Islamic University of Riau, Indonesia.
5. RIKA MELYANTI​ - Doctoral Program of Technology and Vocational Education, Faculty of Engineering - Universitas Negeri Padang, Indonesia & Study Program of Information System, STMIK Hang Tuah Pekanbaru, Indonesia.
6. FITRI AYU - Doctoral Program of Technology and Vocational Education, Faculty of Engineering - Universitas Negeri Padang, Indonesia & Study Program of Information Management, AMIK Mahaputra Riau, Indonesia.
Indonesian is the state language which is the official language of the Unitary State of the Republic of Indonesia. The Indonesian language used must be standard or standard Indonesian and by good and correct Indonesian spelling rules. In its implementation in the field, there are still many errors in the standard language in the development of the national culture, science, and technology. An example is found in the use of the wrong standard words in writing scientific essays in Indonesian. This happens because of the lack of mastery of standard vocabulary among writers. In addition, it can also occur due to the habit of people who often pick up the language around them without any filtering process first. The languages that are commonly used are considered correct and are never interested in finding out the origin or meaning of the language. In the end, what happened was the use of the wrong Indonesian language was used for generations. Based on the phenomena that occur in society, it is necessary to research to build a non-standard word shortening system using the Python programming language and the Approximate String Matching method. This system will match the words in the nonstandard word bag of words (TB) with the abstract text. Abstract data used as samples are abstracts from texts (reports, papers, scientific works) provided that the number of words in the abstract does not exceed 200 words. The results of this study can find and determine the number of non-standard words contained in one or several abstracts and replace them with standard words.
standard words, non-standard words, bag-of-words, approximate string matching