Design of Telecom Operators IDC Resale Analysis
System Based on Spark

Mingyang Song; Yan Huang; Guojian Xu; Zhenghong Jia; Li Tang; Zhenggang Leng

+1 (929) 600-8049

- Feedback
- Signup
- Submit Manuscript

e-Pub

Full Text

COJ Robotics & Artificial Intelligence

Design of Telecom Operators IDC Resale Analysis System Based on Spark

Mingyang Song¹, Yan Huang², Guojian Xu¹, Zhenghong Jia¹*, Li Tang² and Zhenggang Leng¹

¹College of Information Science and Engineering, Xinjiang University, Urumqi, China

²Network Department, China Mobile Communications Group Xinjiang Co, Ltd Urumqi, Urumqi, China

*Corresponding author: Zhenghong Jia, College of Information Science and Engineering, Xinjiang University, Urumqi, China

Submission: February 24, 2023;Published: April 21, 2023

DOI: 10.31031/COJRA.2023.03.000553

ISSN:2832-4463
Volume3 Issue1

Opinion

The article proposes a system solution for analyzing logs using big data. It adopts the Hadoop ecological big data processing framework and the calculation method of the Spark engine. In terms of receiving data, it adopts the current mainstream page crawling tool Scrapy and uses crawlers to supplement the data we want [1-5]. To obtain the company registration and filing data, compare the log information in the big data luster, and filter out these companies’ domain name aliases and IP information from the logs. Build a data warehouse model, divide the fine-grained data from data acquisition to data analysis, filter the data layer by layer, optimize the data retrieval and transaction management, and use the standardized dimensional data model to adjust the performance of the database, so that the database can be retrieved very quickly, and the organization of the data warehouse is easier for users to understand and use, and the requirements for different functional granularities of daily analysis and weekly analysis are determined [6-9].

Build a resale analysis platform, display resale statistical analysis through the UI interface of Spring boot architecture, use LayUI and Bootstrap to design front-end web pages, Spring Security for security verification, Echarte data reports, and Ajax front-end interaction. The backend uses MySQL data and python scripts for data analysis [10-13].

The contributions are as follows:
A. Obtain the sub-domain names registered by TOP55 companies through Scrapy crawlers to establish the TOP55 customer domain name information database.
B. Propose an improved generalized suffix automaton algorithm, build a big data platform to deduplicate and clean the DNS log fields, synthesize the subdomain name database into a generalized suffix automaton tree, input the domain name field of each line in the DNS log, and retrieve the matching The name domain name and IP in the log.
C. Adding a caching middleware algorithm in the Scrapy framework is proposed. The Scrapy crawler obtains the corresponding attribution company of the CNAME domain name and avoids repeatedly executing the attribution crawling of the same name by asking the cache middleware whether it already exists before crawling the attribution of the name Fetching greatly reduces the time spent on crawlers.
D. Use the python-based pandas matching and continuous regularization algorithm to find the IP corresponding to the IP.
E. The Spring boot platform builds the TOP55 customer resale behavior analysis page platform, analyzes the resale times and resale time of specific companies, and draws the resale distribution map.

References

Oliner A, Ganapathi A, Xu W (2011) Advances and challenges in log analysis. ACM Queue 9(12): 30.
Hingave H, Ingle R (2015) An approach for map reduce based log analysis using hadoop. In: 2^nd International Conference on Electronics and Communication Systems (ICECS), Coimbatore, India, pp. 1264-1268.
Lin X, Wang P, Wu B (2013) Log analysis in cloud computing environment with hadoop and spark. In: 5^th IEEE International Conference on Broadband Network & Multimedia Technology, Guilin, China, pp. 273-276.
Ye S, Guo Q, Liu W, Chen L, Tang W (2022) Design of distributed student information sharing optimization platform based on SOA architecture. International Core Journal of Engineering 8(4).
Tian X, Zhang T, Zhuang X, He X (2020) Research and implementation of campus network search engine based on scrapy framework and elastic search. In: 2020 Chinese Control and Decision Conference (CCDC), Hefei, China, pp. 4193-4198.
Singh SP, Goyal N (2014) Security configuration and performance analysis of ftp server. IJCCTS 2(2).
Jin D (2021) Image information collection system based on python web crawler technology. Converter 2: 606-612.
Singh P, Singh S, Mishra PK, Garg R (2022) A data structure perspective to the RDD-based apriori algorithm on spark. Int J Inf Tecnol 14(3): 1585-1594.
Lu X (2019) The analysis of KMP algorithm and its optimization. J Phys Conf Ser 1345(4): 042005.
Zheng T, Zhang Z, Cheng X (2020) SAHA: A string adaptive hash table for analytical databases. Applied Sciences 10(6): 1915.
Hendrian D, Takagi T, Inenaga S (2019) Online algorithms for constructing linear-size suffix trie. ArXiv 10.
Rahman MMS, Aziz MMA, Mohammed N, Jiang X (2021) Privacy-preserving string search on encrypted genomic data using a generalized suffix tree. Informatics in Medicine Unlocked 23: 100525.
Islam M, Rahaman S, Meng N, Hassanshahi B, Krishnan P, et al. (2020) Coding practices and recommendations of spring security for enterprise applications. In: 2020 IEEE Secure Development (SecDev), Atlanta, GA, USA, pp. 49-57.

© 2023 Zhenghong Jia. This is an open access article distributed under the terms of the Creative Commons Attribution License , which permits unrestricted use, distribution, and build upon your work non-commercially.

Submit Query

PubMed Indexed Articles

Track Your Article

Editor In Chief

Hirotada TSUJII

Ph.D in Agriculture from Faculty of Agriculture, Tohoku University

Approaches in Poultry, Dairy & Veterinary Sciences

Maria Kuman

Research Professor, PhD, Holistic Research Institute

Advances in Complementary & Alternative Medicine

Tomasz Karski

MD PhD, Professor, Vincent Pol University

Orthopedic Research Online Journal

Jiexiong Feng

Professor, Chief Doctor, Director of Department of Pediatric Surgery, Associate Director of Department of Surgery, Doctoral Supervisor Tongji hospital, Tongji medical college, Huazhong University of Science and Technology

Research in Pediatrics & Neonatology

Muhammad Atiqullah

Senior Research Engineer and Professor, Center for Refining and Petrochemicals, Research Institute, King Fahd University of Petroleum and Minerals (KFUPM), Dhahran, Saudi Arabia

Research & Development in Material Science

Ian James Martins

Fellow of International Agency for Standards and Ratings (IASR), Edith Cowan University, Sarich Neuroscience Research Institute

Advancements in Case Studies

Thomas F George

Chancellor Emeritus / Professor Emeritus of Chemistry and Physics, University of Missouri–St. Louis

Annals of Chemical Science Research

Jose Crisologo de Sales Silva

Ph.D in Science from the Federal University of Alagoas, UFAL, Brazil

Novel Research in Sciences

Naglaa Sami Adbel Aziz Mahmoud

Assistant Professor in College of Architecture, Art and Design

Academic Journal of Engineering Studies

Tong-Ching Tom Wu

Interim Dean, College of Education and Health Sciences, Director of Biomechanics Laboratory, Sport Science Innovation Program, Bridgewater State University

Research & Investigations in Sports Medicine

Dr. Jose Luis Turabian

Professor of numerous training courses in Family Medicine

Associative Journal of Health Sciences

Dariusz Jacek Jakóbczak

Assistant Professor, Department of Electronics and Computer Science

COJ Electronics & Communications

Önder Pekcan

Emeritus Professor of Physics, Kadir Has University, Turkey

Polymer Science: Peer Review Journal

Member In

View All...

Quick Links

Editorial Board Registrations

×

Join as Editor

Join as Associate Editor
Submit your Article
Best Paper of the Volume
Reprints
Refer a Friend

×

Refer a Friend

Suggested By

Referrer Details
Advertise With Us

×

Advertise With Us

Our Recent Edition

Top Editors

Zhengcai Lou

Wenzhou Medical University, China
Ya Lie Ku

Fooyin University, Taiwan
Volkan Sarper Erikci

Saglik Bilimleri University, Turkey
Tomasz Karski

Vincent Pol University, Poland
Thamil Selvam

National Defence University of Malaysia, Malaysia
Tarik Baykara

Dogus University, Turkey
Steven Smith

Hope College, USA
Stanislav Grigoriev

Russian Academy of Sciences, Russia
Shi Zhou

Southern Cross University, Australia
Shewikar Farrag

Umm Al-Qura University, Saudi Arabia
Ray Marks

City University of New York, USA
Praveen K Maghelal

Khalifa University of Science & Technology, United Arab Emirates
Peng Yu

Hebei Normal University, China
Nawal Mohamed Khalafallah

Alexandria University, Egypt
N K Kishore

Indian Institute of Technology Kharagpur, India
Muzzalupo Innocenzo

Council for Agriculture Research and Analysis of Agri Economy (CREA), Italy
Muhammad Atiqullah

King Fahd University of Petroleum and Minerals, Saudi Arabia
Mohamed A Rashed

King Abdulaziz University, Saudi Arabia
Maurice E Morgenstein

University of Oregon, USA
Martin Sweatman

University of Edinburgh, Scotland
Maria Kuman

University of Tennessee, USA
Manuel Velasco

Central University of Venezuela, Venezuela
Majid Monajjemi

Islamic Azad University Central Tehran Branch, Iran
Luisetto Mauro

Tourin University, Italy
Lloyd Arthur Jenkins

Teaching & Public Speaking, Spain
Leonardo Milella

Paeditric Hospital "Giovanni XXIII", Italy
Kanakis Dimitrios

University of Nicosia, Cyprus
Jose Luis Clua Espuny

Universidad Miguel Hernández de Elche, Spain
John Korstad

Oral Roberts University, USA
Jinliang Zhang

Beijing Normal University, China
Irina Koretsky

Howard University, USA
Ian James Martins

Edith Cowan University, Australia
Hamid Yahiya Hussain

Dubai Health Authority, UAE
Gundu HR Rao

University of Minnesota, USA
GP Karmakar

Indian Institute of Technology Kharagpur, India
Ghassan George Haddad

Serhal Hospital, Lebanon
George Gregory Buttigieg

University of Malta, Malta
Fumihiko Hinoshita

National Center for Global Health and Medicine, Japan
Freida Pemberton

Molloy College, USA
Francisco Welington de Sousa Lima

Federal University of Piauí, Brazil
Florian Bert

Krankenhaus Nordwest Hospital, Germany
Fathi Habashi

Laval University, Canada
Dora Alicia Cortes Hernandez

Cinvestav-Unidad Saltillo, Mexico
Daniel Kinem

UPMC Hamot Neuroscience Institute, USA
Conxita Mestres Miralles

Ramon Llull University, Spain
Barry Kraynack

White Bear Associates, LLC, USA
Arkady S Voloshin

Lehigh University, USA
Alireza Heidari

California Southern University, USA
Alex Guskov

Institute of Solid State Physics of RAS, Russia
Alan Diego Briem Stamm

University of Buenos Aires, Argentina
Ahmed Nasr Ghanem

Mansoura University, Egypt
Afaf K El Ansary

King Saud University, Saudi Arabia
A Bernardes

University of Coimbra, Portugal

Financial Support

Latest e-Books

Latest Video

© 2017 Crimson Publishers, All rights reserved. No part of this content may be reproduced or transmitted in any form or by any means as per the standard guidelines of fair use. Creative Commons License Open Access by Crimson Publishers is licensed under

a Creative Commons Attribution 4.0 International License. Based on a work at www.crimsonpublishers.com. Best viewed in

| Above IE 9.0 version

Scroll