Not found

"Big Data" Applications

       地球科学领域中,大数据的应用正日益成为研究的核心。随着遥感技术、地质勘探和环境监测技术的进步,科学家们能够收集到大量关于地球的数据,包括气候模式、地质结构、生态系统变化等。这些数据的规模、多样性和复杂性要求采用先进的数据分析技术来揭示其中的模式和趋势。 
       大数据在地球科学中的应用包括气候模型的建立、自然灾害预警系统、资源勘探和环境影响评估等方面。例如,通过对历史气候数据的分析,科学家可以预测未来的气候变化趋势,并为应对极端天气事件制定策略。在环境监测中,大数据分析有助于识别污染源和评估人类活动对生态系统的影响。此外,地质勘探中的大数据技术可以提高矿产资源的探测精度,优化勘探计划。 

       大数据在地球科学中的应用不仅推动了科学发现,也为全球可持续发展提供了强有力的技术支持。

Please wait a minute...
  • Select all
    |
  • Jiaqi YAO, Huanyu CHANG, Mengran WANG, Min CHEN, Fan MO, Nan XU, Zhen WEN, Yongqiang CAO
    Advances in Earth Science. 2024, 39(4): 374-390. https://doi.org/10.11867/j.issn.1001-8166.2024.027

    Hydrological and water resource monitoring are pivotal components of Earth observation systems, crucial for supporting the high-quality development of water conservancy in the modern era, fulfilling the requirements of “three water” co-governance, and implementing the “sixteen words” water-control strategy. Satellite remote sensing offers a scalable, rapid, and high-precision data acquisition pathway. Nonetheless, challenges persist in the application of existing satellite remote sensing in hydrology and water resources, including difficulties in achieving multi-satellite synchronous observation, limited emergency response capability, and susceptibility to adverse weather conditions. In December 2022, NASA launched the Surface Water and Ocean Topography (SWOT) satellite, the first satellite in the world designed to observe global land and ocean water resources through multisensor collaboration. This groundbreaking satellite greatly improves the spatial and temporal resolution and accuracy of hydrology and water resource monitoring. This study systematically reviews the development status, applications, and technical challenges of hydrological and water resource monitoring satellites. It also analyzes the satellite parameters, scientific tasks, algorithm flow, and application products of SWOT, providing a valuable reference for future satellite design planning and key data processing technologies, especially in China.

  • Chunlin HUANG, Jinliang HOU, Weide LI, Juan GU, Ying ZHANG, Weixiao HAN, Weizhen WANG, Xiaohu WEN, Gaofeng ZHU
    Advances in Earth Science. 2023, 38(5): 441-452. https://doi.org/10.11867/j.issn.1001-8166.2023.022

    Data-driven methods with deep learning as their core have been gradually applied in Earth science; however, challenges remain regarding the interpretability of models and physical consistency. With the background of remote sensing big data, combining deep learning and data assimilation methods to develop new techniques for the simulation and prediction of terrestrial water cycle processes has become an important research direction in Earth science. Τhe progress in deep learning in recent years combines improving the quality of observation data of terrestrial water cycle components and reducing the uncertainty of physical models. Furthermore, the key scientific issues regarding data assimilation in terrestrial hydrology based on deep learning fusing remote sensing big data are classified according to the observations, physical models, and system integration: How can the temporal and spatial representativeness of samples be enhanced when deep learning inverts remote sensing products? How can a new physics-guided deep learning method be developed within the framework of data assimilation? How can the predictability of the terrestrial water cycle be improved through the “data-model” dual drive? Relevant research and exploration should help promote the in-depth application of the “data-model” hybrid modeling method in the field of hydrology and improve the simulation and prediction capacity of the terrestrial water cycle process.

  • Kai WANG, Shaojie ZHANG, Juan MA, Hongjuan YANG, Dunlong LIU, Chaoping YANG
    Advances in Earth Science. 2022, 37(10): 1054-1065. https://doi.org/10.11867/j.issn.1001-8166.2022.042

    Identification of the landslide deformation stage is a key aspect of landslide warning systems. However, many displacement curves lack the obvious characteristics of the three stages, making it challenging to identify the deformation stages of landslides. To overcome the defects of the Saito method, we proposed a method to extract the deformation stages based on a morphology analysis. A total of 1944 deformation stages were identified from the GNSS surface-displacement monitoring database to form a big data environment. Then, we analyzed the spatial distribution of the displacement stages of various lithologic zones in China and used cluster analysis to develop multi-stage warning criteria for deformation stages. The spatial probability and duration of each warning level were also discussed in this study. Analysis results indicated that the power law adequately described the distribution of daily deformation rate for each lithologic region, with daily deformation rates below 10 mm/d accounting for the majority of observations. Cluster analysis revealed that the number of deformation stages within the “none”, “blue”, “yellow”, “orange”, and “red” warning levels also followed the power rule for each lithologic region. It should be noted that the warning threshold of the deformation rate decreased from lithology 1 to lithology 4 at the same warning level. According to the duration analysis of the deformation stage, the longest-lasting daily rate was 0.30~2.14 mm/d with a duration of 120.22~160.96 days; whereas the shortest-lasting daily rate was 2 734.18~31 770.00 mm/d with a duration of 0.004 3~0.020 0 days. In this study, we analyzed the spatial distribution and early-warning criteria of landscape development stages in a big data environment, which could provide a scientific foundation and direction for the development of an intelligent landslide warning model.

  • Haodi WANG, Shiyao CHEN, Senliang BAO, Kaijun REN
    Advances in Earth Science. 2022, 37(8): 822-840. https://doi.org/10.11867/j.issn.1001-8166.2022.044

    Marine science is a discipline developed on the basis of continuous observation data accumulation, and major breakthroughs in marine science development history are inseparable from the updating of ocean datasets. The time extension of existing gridded datasets has become the main form of updating ocean data products. This review summarizes the current development of gridded marine environment datasets. First, the historical development of ocean observations is divided into three stages: an initial accumulation period dominated by sparse observations, a rapid growth period guided by international observation programs, and a high-quality development period driven by data assimilation and ocean reanalysis. Starting from the three key elements of temperature, salinity, and ocean current, we focus on the global gridded ocean environment datasets published and updated internationally in recent decades, including six flow field datasets, such as HYCOM and OFES, and ten thermohaline datasets, such as Argo and IAP. Based on previous studies, the sources, characteristic information, advantages, and disadvantages of these datasets are briefly reviewed to provide a reference for marine scientists. Finally, the future development direction and research focus of ocean gridded datasets are discussed.

  • Bingquan WANG, Youhua RAN
    Advances in Earth Science. 2021, 36(11): 1137-1145. https://doi.org/10.11867/j.issn.1001-8166.2021.120

    The maximum freezing depth is an important indicator for the thermal state of seasonally frozen ground, and its changes have an important impact on the regional water cycle, ecological processes and engineering stability. This paper released a soil maximum freezing depth grid dataset for 10-year period from 1961 to 2020 in Northwest China and Tibet, with a spatial resolution of 1 km. The dataset was produced by integrating downscaled and bias corrected weather data, elevation and soil properties using a support vector machine model with 200 ensemble simulations. The 10-fold cross-validation shows that the accuracy of the support vector machine model is acceptable [R2 = 0.70 ± 0.29, RMSE = (23.63 ± 10.30) cm, bias = (-0.77 ± 6.01) cm]. Validation using in-situ data shows that the R2 for the four periods 1980s, 1990s, 2000s and 2010s are 0.77, 0.83, 0.73 and 0.71 respectively, and the RMSE are 27.14 cm, 22.42 cm, 21.63 cm and 23.58 cm respectively. The uncertainty of the simulation results is stable throughout the simulation period. Based on this dataset, we found that the soil maximum freezing depth in the Northwest China and Tibet decreased significantly between 1960s and 2020s, with an average rate of 3.02 cm per decade. The dataset can be downloaded via the National Tibetan Plateau/Third Pole Environment Data Center (DOI: 10.11888/Geocry.tpdc.271774).

  • Pengfei MA,Zhifei LIU,Shouting TUO,Jingxin JIANG,Yiwei XU,Xiumian HU
    Advances in Earth Science. 2021, 36(6): 643-662.

    Since the 1960s, 297 expeditions carried out by Scientific Ocean Drilling have recovered more than 4×105 meters of cores and collected a large amount of data. However, the data based on core sample testing and direct observation are scattered in different documents and databases in a heterogeneous format, which hinders the extensive sharing and effective use of them. Through systematic investigation of the official reports, databases, and academic papers of the four stages of Scientific Ocean Drilling, we show the present status of these scientific data including their sources, storage media, and specific types. The data of Scientific Ocean Drilling mainly consist of shipboard and post-expedition results. Nearly 200 data types classified into 15 categories including coring summary, lithostratigraphy, sedimentology and mineralogy, paleontology, stratigraphy, geochemistry, structural geology, geophysics, etc. are recorded in tables, figures, and texts. We find that these scientific data are hierarchically organized and have specific spatial-temporal properties; their sources are both straightforward and complex with different storage formats; and data types of different sites are not only consistent but also diverse. The data of Scientific Ocean Drilling are typical scientific big data in the field of earth sciences. Meanwhile, it is believed that the compilation of the Scientific Ocean Drilling Data would be of significance for fast data acquisition and future studies. This step not only has the potential to solve major scientific problems in the field of Earth science such as long-term biological evolution, global material circulation, paleoclimate and paleoceanography, and deep-sea mineral resource evaluation, but may contribute to the transformation of scientific research paradigm as well. Meanwhile, a proposal for data format unification and data compilation is also presented here.

  • Yong Wei,Qiang Xu,Zhuo Wang,Huajin Li,Songlin Li
    Advances in Earth Science. 2020, 35(10): 1087-1098. https://doi.org/10.11867/j.issn.1001-8166.2020.083

    Model test is widely accepted and used in the field of civil engineering, mining engineering and earth sciences, etc. At present, the static terrain data are measured before and after each experiment by terrestrial laser scanning, however it is necessary to explore new technology to obtain dynamic terrain data in the course of the experiment. By taking the specified experimental tests of debris avalanche as an example, the method of 4D reconstruction based on dynamic photogrammetry was described in detail. The dynamic terrain data of the model test were obtained after the data had been processed, and then the propagation and deposit features of debris avalanche were analyzed in detail. The results show that the dynamic terrain data of the model test can be obtained accurately with the method, and the interpretation of the propagation and deposit should be relatively easy by analyzing the data of model test in detail. This is not only the new technology applied in the document of the dynamic terrain of the model test, but also causes a great change for the experimental analysis, and it deserves to be applied widely.

  • Juanle Wang,Lei Shi,Yujie Wang,Mengxu Gao,Bo Xu,Chao Wang,Mingming Wang,Yanjie Wang,Yezhi Zhou
    Advances in Earth Science. 2020, 35(8): 839-847. https://doi.org/10.11867/j.issn.1001-8166.2020.070

    Scientific data are strategic resources, and the aggregation of scientific data is an important method to seize the upstream and competitive highlands of scientific data. Notably, it is challenging to grasp the international situation and the scientific laws concerning the mode of scientific data aggregation; exploring the modes and methods of scientific data aggregation that are suitable for China's national conditions is also difficult. This paper investigated and analyzed the modes of scientific data aggregation both at home and abroad from the viewpoints of international organizations, international scientific programs, government agencies, and professional data centers. Five modes of scientific data aggregation were summarized, including scientific research projects converging to designated data centers/repositories, scientific research projects dispersing to data centers/repositories, individual scientists submitting datasets to data centers/repositories with published papers, scientific research projects/individual scientists sharing directories/networks, big data computing/processing platform, and citizen science models of open and public convergence. This paper analyzed each mode and the corresponding cases. On this basis, the paper put forward six suggestions for the reasonable aggregation of scientific data in China, including the implementation of the “Measurement of Scientific Data Management”, certification of data aggregation centers, scientific data collection and publishing in journals, construction of data aggregation networks, aggregation of international resources, and construction of the whole data aggregation chain.

  • Ling Zhang, Ping Wang, Xiyun Chen, Yong Yin
    Advances in Earth Science. 2020, 35(4): 414-430. https://doi.org/10.11867/j.issn.1001-8166.2020.030

    The U-Pb chronology of detritus zircon is an important method to explore sediment provenance, which is widely used in sedimentology, geotectonics, geomorphology and other fields. This paper reviewed the recent progress of the U-Pb chronology of detrital zircon from three aspects: data acquisition, analysis and comparison. In terms of data acquisition, the sample preparation method, isotope age data selection and test quantity were expounded from the basic principle; In terms of data analysis, the data visualization methods of Probability Density Plot (PDP), Kernel Density Estimate (KDE) and Cumulative Age Distribution (CAD) were compared; In terms of data comparison, the basic algorithm and application advantages of quantitative comparison were analyzed with examples, including (dis)similarity measures based on non-parametric hypothesis tests (K-S test), (dis)similarity measures based on age spectrum comparison (Cross-correlation coefficients) and (dis)similarity measures based on Multi-Dimensional Scali (MDS). Finally, three commonly used software tools were introduced. Suggestions were given in terms of data acquisition, analysis and comparison for future research.

  • Chunguan Zhang,Xiang Li,Bingqiang Yuan,Lijun Song
    Advances in Earth Science. 2019, 34(3): 288-294. https://doi.org/10.11867/j.issn.1001-8166.2019.03.0288

    In order to evaluate the quality of the offshore magnetic data in the Earth Magnetic Anomaly Grid (2-arc-minute resolution)(EMAG2), the authors chose the aeromagnetic data at 1∶500 000 and 1∶1 000 000 scales of the southern section of the Kolbeinsey Ridge with a total area of 193 500 km2 to compare and analyze. Based on the EMAG2 data, the authors obtained the EMAG2 (downward continuation 4 km) and the aeromagnetic anomaly (upward continuation 4 km) using the analytical continuation method. Then, the correlation coefficients between the aeromagnetic anomaly (upward continuation 4 km) and the EMAG2, and the aeromagnetic anomaly and the EMAG2 (downward continuation 4 km) were calculated by the correlation analysis method. Finally, through comprehensive analysis of the features of these correlation coefficients and differences, the quality of the magnetic data of the southern section of the Kolbeinsey Ridge was evaluated in the database EMAG2. The results showed that the EMAG2 integrated a large number of the airborne or offshore magnetic data. The quality of the offshore magnetic data is relatively high in the offshore areas with dense lines. However, the quality of the offshore magnetic data is relatively low if the EMAG2 data at 4 km altitude is converted to the anomaly data at sea level.

  • Juanle Wang,Mingming Wang,Lei Shi,Mengxu Gao,Mingqi Chen,Xiaohuan Zheng,Chao Wang,Yujie Wang
    Advances in Earth Science. 2019, 34(3): 306-315. https://doi.org/10.11867/j.issn.1001-8166.2019.03.0306

    Scientific data is an important scientific and technological basis and national strategic resource. With the coming of the era of big data, countries all over the world have added scientific data management into their national development strategies. In order to improve the scientific data management in our country and make full use of the development opportunity from big data, the General Office of the State Council officially issued “The Measures for the Administration of Scientific Data” in March, 2018. In the new international and domestic policy on data management, this study analyzed and summarized 11 aspects of international situation of scientific data management, and dissected the development models of developed countries’ scientific data centers deeply. And also, the study put forward new ideas for the policy-making of scientific data, the construction of scientific data centers, the life cycle management of scientific data, the capacity building of data organizations, the authentication of scientific data, the publication of data, the return mechanism of the data and the comprehensive integration, and the safety management of the data in the scientific data management of Earth sciences in China.

  • Yongtao Li, Jianwen Li, Lin Pan, Liangliang Guo, Rongrong Wei, Dezhi Liu
    Advances in Earth Science. 2018, 33(11): 1161-1168. https://doi.org/10.11867/j.issn.1001-8166.2018.11.1161.

    With the successful launch of the sixteen MEO satellites of the Beidou-3 global satellite navigation system and the broadcast of new signals, Beidou has officially entered the global construction stage while the international GNSS Monitoring and Assessment System (iGMAS) is also performing systematic testing and evaluation on various aspects of operational performance from satellite end to ground receiving end of Beidou-3 system. This paper analyzed and evaluated the observation quality of new signals B1C and B2a broadcasted by the twelve new Beidou-3 MEO satellites on the observation data integrity rate, multipath error, pseudorange noise and Carrier-to-Noise Ratio (CNR) compared with GPS and GALILEO. The results show that the observation data integrity rate of B1C signal is better than that of B2a signal in the Beidou-3 system. In the aspects of multipath error, pseudorange noise and CNR, B2a signal is better than that of B1C. The performances of the twelve MEO satellites of Beidou-3 are equivalent, that is, for Beidou-3, the consistency of satellites can be guaranteed. In terms of pseudo-range noise, Beidou-3 is slightly worse than GPS and GALILEO While the observation data integrity rate, multipath error and CNR of Beidou-3 are equivalent to those of GPS and GALILEO.

  • Jinglun Mao, Yiqing Zhu
    Advances in Earth Science. 2018, 33(3): 236-247. https://doi.org/10.11867/j.issn.1001-8166.2018.03.0236

    The mechanism of earthquake inoculation and the process of earthquake occurrence are very complicated. Additionally, earthquakes do not happen very often, and we lack enough cognition to the earth’s interior structure, activity regularity and other key elements. As a result, research progress about the theory of earthquake precursors has been greatly restricted. Ground gravity observation has become one of the main ways to study earthquake precursor information in many countries and regions. This paper briefly summarized the surface gravity observation technology and observation network in China: the surface gravity measurement instrument developed from Huygens physical pendulum in seventeenth Century to today’s high-precision absolute gravimeter, and its accuracy reached to ±1×10-8 m/s2. China has successively established the National Gravity Network, Digital Earthquake Observation Network of China,the Crustal Movement Observation Network of China Ⅰ and the Crustal Movement Observation Network of China, to provide a public platform for monitoring non tidal gravity change, seismic gravity and tectonic movement. The use of specific examples illustrated the role of gravity observation data in earthquake prediction. The gravity observation data of ground gravity can be used to capture the information of gravity change in the process of strong earthquake inoculation, and to provide an important basis for the long-term prediction of strong earthquakes. The temporal and spatial variation characteristics of the regional gravity field and its relation to strong earthquakes were analyzed: Before the earthquake whose magnitude is higher than MS 5, generally there will be a large amplitude and range of gravity anomaly zones. Strong earthquakes occur mainly in areas where the gravity field changes violently. The dynamic change images of gravity field can clearly reflect the precursory information of large earthquakes during the inoculation and occurrence. Finally, the existing problems of surface gravity technology in earthquake precursor observation were put forward and the use of gravity measurement data in earthquake prediction research was prospected.