当前位置: 首页 > 文章 > 基于Hadoop分布式文件系统的商业银行大数据分析 山东农业大学学报(自然科学版) 2018,49 (5) 884-888
Position: Home > Articles > Big Data Analysis of Commercial Banks Based on Hadoop Distributed File System Journal of Shandong Agricultural University(Natural Science Edition) 2018,49 (5) 884-888

基于Hadoop分布式文件系统的商业银行大数据分析

作  者:
张登耀
关键词:
Hadoop文件;商业银行;大数据
摘  要:
针对当前Hadoop分布式文件系统数据分析时存在的数据读取时间长,数据本地化率低等问题,本文提出了一种基于Hadoop分布式文件系统的商业银行大数据分析方法。首先对Hadoop分布式文件系统的工作原理和流程进行分析,找到引起不足的原因,然后根据商业银行大数据的特点,对Hadoop分布式文件系统的数据副本数量和数据分布位置进行相应的改进,最后通过仿真模拟实验对数据读取速度、本地化率、磁盘负载等进行分析。结果表明,本方法可以有效减少数据读取时间、提升数据本地化率并均衡磁盘负载,整体性能要明显优于对比方法,具有更好的实际应用价值。
译  名:
Big Data Analysis of Commercial Banks Based on Hadoop Distributed File System
作  者:
ZHANG Deng-yao;School of Finance/Dongbei University of Finance and Economics;
关键词:
Hadoop file;;commercial bank;;big data
摘  要:
In view of the problems of long data reading time and low localization rate of data in the data analysis by Hadoop distributed file system, this paper proposed a large data analysis method for commercial banks based on Hadoop distributed file system. Firstly, the working principle and process of the Hadoop distributed file system were analyzed and the reasons for the shortage were found. Then, according to the characteristics of the big data of the commercial bank, the number of data copies and the data distribution position of the Hadoop distributed file system were improved. Finally, the data reading speed was passed through the simulation simulation experiment. The localization rate, the disk load and so on were analyzed. The results showed that the method could effectively reduce the time of data reading, improve the localization rate of data and balance the load of the disk. The overall performance was better than the contrast method, and it has a better practical application value.
计量
文章访问数: 9
HTML全文浏览量: 0
PDF下载量: 0

所属期刊