Key Technologies and Applications of Geoscience Big Data

Authors

  • Yanbin Zhou School of Resources and Environment, Henan Polytechnic University, Jiaozuo 454003, China

DOI:

https://doi.org/10.54097/92awyg79

Keywords:

Geoscience big data, Hadoop, MapReduce, cloud server, Python, PostgreSQL, MongoDB, geoscience application

Abstract

Geoscience big data refers to massive, multi-source, heterogeneous, and spatiotemporally associated datasets generated in earth science research and engineering practice. It covers geological, geographical, remote sensing, meteorological, hydrological, environmental, and resource-related data. With the development of remote sensing observation, GIS, Internet of Things monitoring, cloud computing, and artificial intelligence, geoscience data are increasing continuously in scale, type, and update frequency. Conventional stand-alone data processing methods have gradually become insufficient in terms of storage capacity, computing efficiency, spatial query, and integrated analysis. Based on a course report on geoscience big data, this paper reorganizes the concepts, technical framework, implementation process, and application scenario of geoscience big data in the form of an academic paper. First, the data sources, basic characteristics, and processing requirements of geoscience big data are analyzed. Then, a technical framework for geoscience data processing is constructed around cloud servers, Linux operating environments, Hadoop/MapReduce, MongoDB, PostgreSQL/PostGIS, and Python batch processing. On this basis, practical operations, including server connection, file management, MapReduce-based column statistics, and Java program compilation and execution, are summarized. The study indicates that geoscience big data technologies can improve the organization, storage, computation, and analysis efficiency of multi-source geoscience data and provide technical support for typical applications such as WebGIS-based disaster monitoring and early warning. However, further application still needs to address issues such as inconsistent data standards, insufficient data quality control, complex system integration, and data security protection.

Downloads

Download data is not yet available.

References

[1] Li Z B, Chen J P, Wang D C. Development and research status of geoscience big data technologies in China and the United States [J]. Journal of Geology, 2020, 44(04): 345-355. (in Chinese)

[2] Peng J B, Li Z H. Can geoscience big data contribute to geological hazard prediction? [J]. Earth Science, 2022, 47(10): 3900-3901. (in Chinese)

[3] White T. Hadoop: The Definitive Guide[M]. 4th ed. Sebastopol: O'Reilly Media, 2015.

[4] Dean J, Ghemawat S. MapReduce: Simplified Data Processing on Large Clusters [C]//Proceedings of the 6th Symposium on Operating Systems Design and Implementation. 2004: 137-150.

[5] Ghemawat S, Gobioff H, Leung S T. The Google File System [C]//Proceedings of the 19th ACM Symposium on Operating Systems Principles. 2003: 29-43.

[6] Chang F, Dean J, Ghemawat S, et al. Bigtable: A Distributed Storage System for Structured Data [J]. ACM Transactions on Computer Systems, 2008, 26(2): 1-26.

[7] Obe R O, Hsu L S. PostGIS in Action [M]. 2nd ed. Shelter Island: Manning Publications, 2015.

Downloads

Published

2026-05-15

Issue

Section

Articles

How to Cite

Zhou, Y. (2026). Key Technologies and Applications of Geoscience Big Data. International Journal of Advanced Engineering and Technology Research, 2(1), 43-49. https://doi.org/10.54097/92awyg79