For an up-to-date list and metrics see Google Scholar .
Click here for a quick summary of the journals/conferences where I have published.
Journals:
Applied Geography
Appl Geog -
Applied Geography is a journal devoted to the publication of research which utilizes geographic approaches (human, physical, nature-society and GIScience) to resolve human problems that have a spatial dimension.
Concurrency and Computation: Practice and Experience
C&C:P&E -
Concurrency and Computation: Practice and Experience is a computer science journal publishing original research and review papers on parallel and distributed computing systems. With a broad scope, the journal covers high-performance computing, data science, artificial intelligence and machine learning, big data, security, quantum and cloud computing, and more.
Geographical Analysis
Geog Analysis -
Geographical Analysis publishes geographical theory, model building, and quantitative methods to geographers and scholars in a wide spectrum of related fields.
International Journal of Health Geographics (IJHG)
IJHG -
International Journal of Health Geographics covers a wide range of interdisciplinary geospatial topics in a health/healthcare context, from spatial data infrastructure and Web geospatial interoperability research, to research into real-time Geographic Information Systems (GIS)-enabled surveillance services, remote sensing applications, spatial epidemiology, spatio-temporal statistics, internet GIS and cyberspace mapping, participatory GIS and citizen sensing, geospatial big data, healthy smart cities and regions, and geospatial Internet of Things and blockchain.
International Journal of Geographical Information Science
IJGIS -
International Journal of Geographical Information Science publishes international research in the rapidly growing field of geographical information science (GIScience).
Journal of Artificial Societies and Social Simulation (JASSS)
JASSS -
The Journal of Artificial Societies and Social Simulation is an interdisciplinary journal for the exploration and understanding of social processes by means of computer simulation.
PLOS ONE
PLOS ONE -
PLOS ONE welcomes original research submissions from the natural sciences, medical research, engineering, as well as the related social sciences and humanities.
SoftwareX
SoftwareX -
SoftwareX aims to acknowledge the impact of software on today's research practice, and on new scientific discoveries in almost all research domains.
Transactions in GIS (TGIS)
TGIS -
Transactions in GIS is an international, peer-reviewed journal that publishes original research articles, review articles, and short technical notes on the latest advances and best practices in the spatial sciences. The spatial sciences include all of the different ways in which geography may be used to organize, represent, store, analyze, model and visualize information.
Conferences:
ACM Practice and Experience in Advanced Research Computing (PEARC)
PEARC -
The ACM Practice and Experience in Advanced Research Computing (PEARC) Conference Series is a community-driven effort that unites cyberinfrastructure and research computing practitioners, developers, and users by providing a forum for discussing ideas, findings, techniques, tools, and experiences.
ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (SIGSPATIAL)
SIGSPATIAL -
The ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems 2022 (ACM SIGSPATIAL 2022) began as a series of symposia and workshops starting in 1993 with the aim of bringing together researchers, developers, users, and practitioners in relation to novel systems based on geospatial data and knowledge, and fostering interdisciplinary discussions and research in all aspects of geographic information systems.
American Geophysical Union (AGU)
AGU -
AGU is a global community supporting more than half a million advocates and professionals in the Earth and space sciences.
Sixth International Conference on Dynamics of Disasters
DOD -
The DOD series of conferences aims to bring together researchers, practitioners, and policy-makers to discuss and advance topics of interest including (but not limited to) optimization, computing, mathematical and statistical modeling, data analytics, emergency and disaster management.
Gateways
Gateways -
Science gateways allow academic research or education communities to access shared data, software, computing services, instruments, educational materials, and other resources specific to their disciplines. They are typically a web portal or a suite of desktop applications.
International Conference on Geographic Information Science
GIScience -
GIScience is the flagship conference in the field of Geographic Information Science.
Institute for Geospatial Understanding through an Integrative Discovery Environment (I-GUIDE) Forum
I-GUIDE Forum -
An international conference for a groundbreaking event focused on Harnessing the Geospatial Data Revolution for Sustainability Solutions.
National Science Foundation (NSF) Harnessing the Data Revolution (HDR) Ecosystem Conference
HDR -
NSF's Harnessing the Data Revolution (HDR) Big Idea is a national-scale activity founded in 2016 to enable new modes of data-driven discovery that address fundamental questions at the frontiers of science and engineering.
Table of Contents
Journal Articles
2024
An Areal Approach to Spatial Accessibility Analysis
Geographical Analysis,
2024
Place-based spatial accessibility quantifies the distribution of access to goods and services across space. The Two-Step Floating Catchment Area (2SFCA) family of methods have become a default tool for spatial accessibility analysis in part due to their intuitive approach and interpretability. This family of methods relies on calculating catchment areas around supply locations to estimate the area and population that may utilize them. However, these “catchment areas” are generally defined by origin-destination matrices of travel-time, giving us point-to-point distances and not polygons with actual area. This means that population geographies (census tracts, blocks, etc.) are binarily included or excluded, with no room for partial inclusion. When using nongranular data, which is often the case due to data privacy restrictions, this has the potential to cause significant errors in accessibility measurements. In this article, we propose Areal 2SFCA: a new approach that considers the area of overlap between travel-time polygons and population geographies. We demonstrate the effectiveness of the Areal 2SFCA method using a case study that compares the Enhanced Two-Step Floating Catchment Area (E2SFCA) and Areal E2SFCA for the state of Illinois in the USA using multiple population granularities.
CyberGIS-Compute: Middleware for democratizing scalable geocomputation
Michels, Alexander C.,
Padmanabhan, Anand,
Xiao, Zimo,
Kotak, Mit,
Baig, Furqan,
and
Wang, Shaowen
SoftwareX,
2024
CyberGIS—geographic information science and systems (GIS) based on advanced cyberinfrastructure—is becoming increasingly important to tackling a variety of socio-environmental problems like climate change, disaster management, and water security. While recent advances in high-performance computing (HPC) have the potential to help address these problems, the technical knowledge required to use HPC has posed challenges to many domain experts. In this paper, we present CyberGIS-Compute: a geospatial middleware tool designed to democratize HPC access for solving diverse socio-environmental problems. CyberGIS-Compute does this by providing a simple user interface in Jupyter, streamlining the process of integrating domain-specific models with HPC, and establishing a suite of APIs friendly to domain experts.
SPASTC: A Spatial Partitioning Algorithm for Scalable Travel-time Computation
International Journal of Geographical Information Science,
2024
Travel-time computation with large transportation networks is often computationally intensive for two main reasons: 1) large computer memory is required to handle large networks; and 2) calculating shortest-distance paths over large networks is computing intensive. Therefore, previous research tends to limit their spatial extent to reduce computational intensity or resolve computational intensity with advanced cyberinfrastructure. In this context, this article describes a new Spatial Partitioning Algorithm for Scalable Travel-time Computation (SPASTC) that is designed based on spatial domain decomposition with computer memory limit explicitly considered. SPASTC preserves spatial relationships required for travel-time computation and respects a user-specified memory limit, which allows efficient and large-scale travel-time computation within the given memory limit. We demonstrate SPASTC by computing spatial accessibility to hospital beds across the conterminous United States. Our case study shows that SPASTC achieves significant efficiency and scalability making the travel-time computation tens of times faster.
2023
EasyScienceGateway: A new framework for providing reproducible user environments on science gateways
Concurrency and Computation: Practice and Experience,
2023
Science gateways have become a core part of the cyberinfrastructure ecosystem by increasing access to computational resources and providing community platforms for sharing and publishing education and research materials. While science gateways represent a promising solution for computational reproducibility, common methods for providing users with their user environments on gateways present challenges which are difficult to overcome. This article presents EasyScienceGateway: a new framework for providing user environments on science gateways to resolve these challenges, provides the technical details on implementing the framework on a science gateway based on Jupyter Notebook, and discusses our experience applying the framework to the CyberGIS-Jupyter and CyberGIS-Jupyter for Water gateways.
Daily Changes in Spatial Accessibility to ICU Beds and Their Relationship with the Case-Fatality Ratio of COVID-19 in the State of Texas, USA
Applied Geography,
2023
During the COVID-19 pandemic, many patients could not receive timely healthcare services due to limited availability and access to healthcare resources and services. Previous studies found that access to intensive care unit (ICU) beds saves lives, but they overlooked the temporal dynamics in the availability of healthcare resources and COVID-19 cases. To fill this gap, our study investigated daily changes in ICU bed accessibility with an enhanced two-step floating catchment area (E2SFCA) method in the state of Texas. Along with the increased temporal granularity of measurements, we uncovered two phenomena: 1) aggravated spatial inequality of access during the pandemic, and 2) the retrospective relationship between insufficient ICU bed accessibility and the high case-fatality ratio of COVID-19 in rural areas. Our findings suggest that those locations should be supplemented with additional healthcare resources to save lives in future pandemic scenarios.
2022
Spatial Accessibility to HIV Testing, Treatment, and Prevention Services in Illinois and Chicago, USA
Kang, Jeon-Young,
Fayaz-Farkhad, Bita,
Chan, Man-pui Sally,
Michels, Alexander,
Albarracin, Dolores,
and
Wang, Shaowen
PLOS ONE,
2022
Accomplishing the goals outlined in “Ending the HIV (Human Immunodeficiency Virus) Epidemic: A Plan for America Initiative” will require properly estimating and increasing access to HIV testing, treatment, and prevention services. In this research, a computational spatial method for estimating access was applied to measure distance to services from all points of a city or state while considering the size of the population in need for services as well as both driving and public transportation. Specifically, this study employed the enhanced two-step floating catchment area (E2SFCA) method to measure spatial accessibility to HIV testing, treatment (i.e., Ryan White HIV/AIDS program), and prevention (i.e., Pre-Exposure Prophylaxis [PrEP]) services. The method considered the spatial location of MSM (Men Who have Sex with Men), PLWH (People Living with HIV), and the general adult population 15–64 depending on what HIV services the U.S. Centers for Disease Control (CDC) recommends for each group. The study delineated service- and population-specific accessibility maps, demonstrating the method’s utility by analyzing data corresponding to the city of Chicago and the state of Illinois. Findings indicated health disparities in the south and the northwest of Chicago and particular areas in Illinois, as well as unique health disparities for public transportation compared to driving. The methodology details and computer code are shared for use in research and public policy.
Particle Swarm Optimization for Calibration in Spatially Explicit Agent-Based Modeling
Journal of Artificial Societies and Social Simulation,
2022
A challenge in computational modeling of Agent-Based Models (ABMs) is the amount of time and resources required to tune a set of parameters for reproducing the observed patterns of phenomena being modeled. Well-tuned parameters are necessary for models to reproduce real-world multi-scale space-time patterns, but calibration is often computationally intensive and time consuming. Particle Swarm Optimization (PSO) is a swarm intelligence optimization algorithm that has found wide use for complex optimization including nonconvex and noisy problems. In this study, we propose to use PSO for calibrating parameters in ABMs. We use a spatially explicit ABM of influenza transmission based in Miami, Florida, USA as a case study. Furthermore, we demonstrate that a standard implementation of PSO can be used out-of-the-box to successfully calibrate models and out-performs Monte Carlo in terms of optimization and efficiency.
2021
An Integrated Framework of Global Sensitivity Analysis and Calibration for Spatially Explicit Agent-Based Models
Transactions in GIS,
2021
Abstract Calibration of agent-based models (ABMs) is a major challenge due to the complex nature of the systems being modeled, the heterogeneous nature of geographical regions, the varying effects of model inputs on the outputs, and computational intensity. Nevertheless, ABMs need to be carefully tuned to achieve the desirable goal of simulating spatiotemporal phenomena of interest, and a well-calibrated model is expected to achieve an improved understanding of the phenomena. To address some of the above challenges, this article proposes an integrated framework of global sensitivity analysis (GSA) and calibration, called GSA-CAL. Specifically, variance-based GSA is applied to identify input parameters with less influence on differences between simulated outputs and observations. By dropping these less influential input parameters in the calibration process, this research reduces the computational intensity of calibration. Since GSA requires many simulation runs, due to ABMs’ stochasticity, we leverage the high-performance computing power provided by the advanced cyberinfrastructure. A spatially explicit ABM of influenza transmission is used as the case study to demonstrate the utility of the framework. Leveraging GSA, we were able to exclude less influential parameters in the model calibration process and demonstrate the importance of revising local settings for an epidemic pattern in an outbreak.
2020
Rapidly Measuring Spatial Accessibility of COVID-19 Healthcare Resources: A Case Study of Illinois, USA
Kang, Jeon-Young,
Michels, Alexander C,
Lyu, Fangzheng,
Wang, Shaohua,
Agbodo, Nelson,
Freeman, Vincent L,
and
Wang, Shaowen
International Journal of Health Geographics,
2020
The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), causing the coronavirus disease 2019 (COVID-19) pandemic, has infected millions of people and caused hundreds of thousands of deaths. While COVID-19 has overwhelmed healthcare resources (e.g., healthcare personnel, testing resources, hospital beds, and ventilators) in a number of countries, limited research has been conducted to understand spatial accessibility of such resources. This study fills this gap by rapidly measuring the spatial accessibility of COVID-19 healthcare resources with a particular focus on Illinois, USA. Specifically, the rapid measurement is achieved by resolving computational intensity of an enhanced two-step floating catchment area (E2SFCA) method through a parallel computing strategy based on cyberGIS (cyber geographic information science and systems). The study compared the spatial accessibility measures for COVID-19 patients to those of general population, identifying which geographic areas need additional healthcare resources to improve access. The results also help delineate the areas that may face a COVID-19-induced shortage of healthcare resources caused by COVID-19. The Chicagoland, particularly the southern Chicago, shows an additional need for resources. Our findings are relevant for policymakers and public health practitioners to allocate existing healthcare resources or distribute new resources for maximum access to health services.
Peer-Reviewed Conference Papers
2024
CyberGIS-Vis for Democratizing Access to Scalable Spatiotemporal Geovisual Analytics: A Case Study of COVID-19
Han, Su,
Kim, Joon-Seok,
Jiang, Yuqin,
Kang, Jeon-Young,
Park, Jinwoo,
Han, Chaeyeon,
Michels, Alexander,
and
Wang, Shaowen
5th ACM SIGSPATIAL International Workshop on Spatial Computing for Epidemiology (SpatialEpi’24),
2024
The COVID-19 pandemic underscored the critical need for effective disease mapping tools, essential for tracking infectious diseases. Following the WHO’s pandemic declaration in March 2020, numerous technological solutions emerged to map cases, assess risk factors, and monitor mobility. However, there remains a shortage of reusable, open-source geovisual analytics tool for rapid response to future pandemics. To address this gap, we developed an innovative open-source JavaScript-based geovisual analytics tool as part of the CyberGIS-Vis project. This paper introduces two visualization modules of CyberGIS-Vis, showcasing their use in visualizing spatiotemporal COVID-19 data by integrating advanced cyberGIS and online visualization with robust analytics for geospatial knowledge discovery.
Data-Intensive Convergence Science for Analyzing Place-Based Spatial Accessibility
I-GUIDE Forum 2024,
2024
Place-based spatial accessibility is a critical tool for measuring the health, resilience, and sustainability of communities. Accessibility methods are employed by a wide range of fields to measure access to food, healthcare, infrastructure and other critical needs. While measures of access are relatively simple, they attempt to capture the complexities of human mobility and spatial decision-making to assess how well populations are served by the infrastructure, resources, and services at their disposal. This paper describes four key areas where data-intensive convergence science can revolutionize our understanding of place-based spatial accessibility by addressing issues of scale, spatial impedance, diversity, and accessibility. By tackling these key issues, we can create measures of access that are more detailed, accurate, inclusive, and approachable, making place-based spatial accessibility a better diagnostic tool as we work towards more sustainable places.
Understanding Complex Socio-Environmental Systems with Spatial Agent-Based Models
I-GUIDE Forum 2024,
2024
Our increasingly connected world is faced with complex socio-environmental problems (e.g., biodiversity loss, climate change, and food insecurity). Tackling these problems requires cross- disciplinary approaches that examine the problems based on synergistic spatial and system thinking. Spatial Agent-Based Models (SABMs) represent a powerful approach to understanding complex socio-environmental systems. However, research on SABMs and associated complex problem solving face grand challenges that must be overcome to effectively unleash the power of SABMs enabled by cyber-based geographic information science and systems (cyberGIS). This paper describes four such grand challenges —reproducibility, scalability, communication, and accessibility. Resolving these challenges will enable new spatial computing frontiers to model complex socio-environmental systems at unprecedented spatiotemporal scales for tackling associated real-world problems.
Building Blocks for Geospatial Software Education Using the I-GUIDE Platform
I-GUIDE Forum 2024,
2024
By combining advanced cyberinfrastructure with geospatial analysis capabilities and resources in an accessible online environment, the I-GUIDE Platform has great potential for geospatial computing focused education. However, learning occurs in different settings and contexts, both formal and informal. For I-GUIDE Platform to be successful, it should have the flexibility to support a variety of educational needs. In this paper, we argue for an expanded set of front-end building blocks to support diverse education and research use-cases, building on existing cyberGIS capabilities and Jupyter backend. We draw from experience working with the CyberGISX platform as an education tool in different learning contexts to suggest a series of front-end building blocks to best leverage the powerful combination of cyberinfrastructure and geospatial resources for flexible and adaptable educational needs.
Providing Accessible Software Environments Across Science Gateways and HPC
Practice and Experience in Advanced Research Computing 2024: Human Powered Computing,
2024
While High-Performance Computing (HPC) resources are powerful for tackling complex, computationally intensive analysis and modeling problems, access to these resources varies across disciplines. Domain scientists in a variety of fields such as social and environmental sciences often lack in-depth technical skills (e.g., familiarity with terminal, knowledge of job schedulers) to effectively utilize HPC resources, hindering desired research. In this context, CyberGIS-Compute is a middleware toolkit designed to democratize HPC access with the main goal of enabling domain scientists in diverse fields to solve computationally intensive problems. A key challenge facing model developers on CyberGIS-Compute is to create a containerized software environment for their models. Domain experts unfamiliar with HPC are generally unfamiliar with containerization technologies (e.g., Docker, Singularity) and thus unable to create/test containers to execute their models. But if they have access to science gateways, they would want to use these familiar software environments on HPC resources. This paper describes a novel approach to integrating the Cern Virtual Machine File System (CVMFS) into CyberGIS-Compute to provide consistent software environments across science gateways and HPC resources.
2023
An Agent-Based Modeling Approach to Spatial Accessibility
Forum 2023 - Harnessing the Geospatial Data Revolution for Sustainability Solutions,
2023
Place-based spatial accessibility represents the ability of populations within geographic units to access goods and services, and thus is an important indicator for sustainable development. Existing spatial accessibility models treat population as simply demand, calculating statistics or optimizing average cost for the population within each geographic unit, rather than modeling individual decisions. This paper proposes AgentAccess, a general-purpose Agent-Based Model (ABM) for spatial accessibility analysis. An ABM framework brings us closer to reality by simulating individual and imperfect decision-making. We introduce the model and compare its results against existing spatial accessibility models using a case study of hospital beds in Cook County, IL, USA.
Streamlined HPC Environments with CVMFS and CyberGIS-Compute
Forum 2023 - Harnessing the Geospatial Data Revolution for Sustainability Solutions,
2023
High-Performance Computing (HPC) resources provide the potential for complex, large-scale modeling and analysis, fueling scientific progress over the last few decades, but these advances are not equally distributed across disciplines. Those in computational disciplines are often trained to have the necessary technical skills to utilize HPC (e.g. familiarity with the terminal), but many disciplines face technical hurdles when trying to apply HPC resources to their work. This unequal familiarity with HPC is increasingly a problem as cross-discipline teams work to tackle critical interdisciplinary issues like climate change and sustainability. CyberGIS-Compute is middle-ware designed to democratize to HPC services with the goal of empowering domain scientists, but a key challenge facing model developers on CyberGIS-Compute is creating a containerized software environment for their models. In this paper, we discuss our work to integrate the Cern Virtual Machine File System (CVMFS) into CyberGIS-Compute to provide consistent software environments across science gateways and HPC resources.
I-GUIDE Climbers: A Model for Multidisciplinary Academic Labs for Early Career Development
Haqiqi, Iman,
Hu, Wei,
Kumaran, Ramya,
Li, Pin-Ching,
Manning, Nicholas,
Michels, Alexander,
Nassar, Ayman,
Park, Jinwoo,
Shi, Jimeng,
Tonks, Adam,
and
Wang, Zhaonan
Forum 2023 - Harnessing the Geospatial Data Revolution for Sustainability Solutions,
2023
In this paper, we propose a new form of multidisciplinary academic collaboration that goes beyond the traditional modes of knowledge exchange. We argue that most research collaboration today is based on interactions between closely related disciplines, in which researchers share data, methods, and insights within a common framework or problem. However, such collaboration may not foster the development of the communication and management skills essential to a multi-disciplinary research career. Therefore, we suggest establishing a network of researchers from divergent, yet complementary, disciplines who are interested in improving these skills through regular interactions and feedback. The main goal of this network is not to conduct research or address a specific research question, but to create a learning environment where researchers can enhance their interdisciplinary competencies through the diverse perspectives and experiences of their peers. Moreover, a multidisciplinary group of early-career professionals provides a space for collaborations to flourish. In this paper, we also offer practical advice for researchers who wish to join or create a similar network.
Impacts of Catchments Derived from Fine-Grained Mobility Data on Spatial Accessibility
12th International Conference on Geographic Information Science (GIScience 2023),
2023
Spatial accessibility is a powerful tool for understanding how access to important services and resources varies across space. While spatial accessibility methods traditionally rely on origin-destination matrices between centroids of administrative zones, recent work has examined creating polygonal catchments - areas within a travel-time threshold - from point-based fine-grained mobility data. In this paper, we investigate the difference between the convex hull and alpha shape algorithms for determining catchment areas and how this affects the results of spatial accessibility analyses. Our analysis shows that the choice of how we define a catchment produces differences in the measured accessibility which correlate with social vulnerability. These findings highlight the importance of evaluating and communicating minor methodological choices in spatial accessibility analyses.
2022
CyberGIS-Cloud: A Unified Middleware Framework for Cloud-Based Geospatial Research and Education
Baig, Furqan,
Michels, Alexander,
Xiao, Zimo,
Han, Su Yeon,
Padmanabhan, Anand,
Li, Zhiyu,
and
Wang, Shaowen
Practice and Experience in Advanced Research Computing,
2022
Interest in cloud-based cyberinfrastructure continues to grow within the geospatial community to tackle contemporary big data challenges. Distributed computing frameworks, deployed over the cloud, provide scalable and low-maintenance solutions to accelerate geospatial research and education. However, for scientists and researchers, the usage of such resources is highly constrained by the steep curve for learning diverse sets of platform-specific tools and APIs. This paper presents CyberGIS-Cloud as a unified middleware to streamline the execution of distributed geospatial workflows over multiple cloud backends with easy-to-use interfaces. CyberGIS-Cloud employs bringing computation-to-data model by abstracting and automating job execution over distributed resources hosted in the cloud environment where the data resides. We present details of CyberGIS-Cloud with support for popular distributed computing frameworks backed by research-oriented JetStream Cloud and commercial Google Cloud Platform.
2020
An Exploration of the Effect of Buyer Preference and Market Composition on the Rent Gradient Using the ALMA Framework
Proceedings of the 3rd ACM SIGSPATIAL International Workshop on GeoSpatial Simulation,
2020
Urban land markets exhibit complex emergent behaviors that have yet to be fully explained by the microeconomic decision-making which constitutes the market. The Agent-based Land MArket (ALMA) framework has been introduced to simulate a bilateral agent-based land market that produces a rent gradient. In this paper, we extend the ALMA framework by introducing two new parameters, heterogeneity, and stochasticity which allow us to explore how the rent gradient is affected by buyers with diverse preferences and a range of market compositions.
2019
CyberGIS-Jupyter for Spatially Explicit Agent-based Modeling: A Case Study on Influenza Transmission
GeoSim ’19: Proceedings of the 2nd ACM SIGSPATIAL International Workshop on GeoSpatial Simulation,
2019
Despite extensive efforts on achieving reproducible agent-based models (ABMs) to improve the capability of this widely adopted methodology, it remains challenging to reproduce and replicate pre-existing ABMs, due to a number of factors such as diverse computing resources and ABMs platforms. In this study, we propose to employ CyberGIS-Jupyter for spatially explicit ABMs. CyberGIS-Jupyter is a cyberGIS framework to achieve data-intensive, reproducible, and scalable geospatial analytics using Jupyter Notebook based on advanced cyberinfrastructure. Influenza transmission in the city of Miami, Florida, USA was used as a case study. In the model, Influenza is transmitted through the contact networks of individual human agents, which are constructed based on commuting behaviors. CyberGIS-Jupyter can support one not only to conduct collaborative and transparent modeling, but also to perform modeling simulation on advanced cyberinfrastructure resources. It may contribute to boosting the reproducibility and replicability of ABMs.
Published Abstracts
2022
CyberGIS-Jupyter for Water - An Open Geospatial Computing Platform for Collaborative Water Research
Li, Zhiyu,
Michels, Alexander,
Padmanabhan, Anand,
Nassar, Ayman,
Tarboton, David G.,
and
Wang, Shaowen
AGU Fall Meeting Abstracts,
2022
Recent advances in cyberinfrastructure and data science promise to transform how hydrologic analysis and modeling are conducted. However, the computational capabilities needed for this potential transformation still remain only accessible to a small set of domain experts, hampering the engagement and contribution from the broader water research community. We have developed a domain-specific online computing platform, called CyberGIS-Jupyter for Water (CJW), that aims to integrate advanced cyberinfrastructure and geospatial capabilities for serving the broad water science communities. CJW represents a novel cyber-based geospatial information science and systems (cyberGIS) framework for harnessing distributed high-performance computing resources to enable collaborative and large-scale hydrologic analysis and modeling. CJW provides a stack of integrated geospatial software tools and libraries to facilitate collaborative and reproducible workflows that have been made interoperable with HydroShare, a web-based hydrologic data and model sharing platform, to expand community access. This talk presents the design and implementation of CJW, and demonstrates its capabilities with several success stories from users and a case study on computationally intensive hydrologic modeling based on WRF-Hydro.
2021
CyberGIS-Compute for Enabling Computationally Intensive Geospatial Research (Ext. Abs.)
Padmanabhan, Anand,
Xiao, Zimo,
Vandewalle, Rebecca,
Baig, Furqan,
Michels, Alexander,
Li, Zhiyu,
and
Wang, Shaowen
SpatialAPI’21: Proceedings of the 3rd ACM SIGSPATIAL International Workshop on APIs and Libraries for Geospatial Data Science,
2021
In this tutorial, we will first start with the basics of CyberGISJupyter and CyberGIS-Compute, then introduce the Python SDK for CyberGIS-Compute with a simple Hello World example. Then, we will take multiple real-world geospatial applications use-cases like spatial accessibility and wildfire evacuation simulation using agent based modeling. We will also provide pointers on how to contribute applications to the CyberGIS-Compute framework.
Towards Reproducible Research on CyberGISX with Lmod and Easybuild (Ext. Abs.)
Proceedings of Gateways 2021,
2021
JupyterHub [1] has become a popular choice in many scientific communities, offering an easy-to-use interface for users with little to no frontend development work while promoting reproducible and replicable (R&R) science [2]. In the broad geospatial science community, CyberGISX [3] provides such a gateway environment with many cyberGIS (i.e., geospatial information science and systems based on advanced cyberinfrastructure) and geospatial software packages prebuilt and ready to use. Like other JupyterHub-based solutions, CyberGISX also provides container-based access for its users and must balance a trade-off between providing a static compute environment which enhances R&R and continuously updating the software environment to keep up with advances in scientific software. Solutions such as Binder [4] have attempted to address this trade-off by having required dependencies encoded in the package and building the software environment at the time of use. However, such a solution comes with two major disadvantages: (a) software is built at the time it is needed, increasing startup time and introducing the possibility that some of the dependencies of the environment are no longer available or have changed; and (b) the onus of specifying and managing software installations is passed to notebook developers, many of whom are domain scientists and not comfortable with such responsibilities. To address these challenges and enhance R&R with minimal effort from end-users, we have designed and implemented a solution on CyberGISX that allows software to be kept on an external file server mounted into each user’s environment. Scientific software is installed with Easybuild [5] and managed by Lmod [6] giving a variety of benefits: (1) the compute environment is more standardized and easily reproducible outside of the gateway; (2) multiple versions of software can be made available to users without increasing container size; and (3) the exact copies of software are always available on the gateway instead of being rebuilt for every release, further enhancing R&R. We also employ an Easybuild-installed Anaconda [7] to create and manage conda environments on the file server. The combination of the software stack from Easybuild and Python environment from conda provides end-users with kernels for their Jupyter notebooks which are persistent and unchanged as the gateway’s container updates. This design enhances R&R and adds functionality for advanced users without introducing technical barriers to non-technical end-users. As such, domain scientists using this solution need not build their own software and specify dependencies, which helps prevent the notebooks they have developed from getting broken by the next software release. This talk explores the new architecture and applications of this solution to CyberGISX [3] and CyberGIS-Jupyter for Water (CJW) [8].
Enabling Computationally Intensive Geospatial Research on CyberGIS-Jupyter with CyberGIS-Compute (Ext. Abs.)
Proceedings of Gateways 2021,
2021
Geospatial research and education have become increasingly dependent on cyberGIS, defined as geographic information science and systems based on advanced cyberinfrastructure (CI), [1] to tackle computation and data challenges. However, the use of advanced cyberGIS capabilities has typically been constrained to a small set of research groups who have the technical expertise of using CI resources. Over the past few years CyberGIS-Jupyter [2,3] has been developed to provide access to cyberGIS capabilities through an easy-to-use Jupyter Notebook interface which has made cyberGIS more accessible. For many cyberGIS and geospatial applications accessing CI resources needed for solving complex problems at scale. However, leveraging CI resources for geospatial application is challenging both due to the steep learning curve and lack of appropriate tools. CyberGIS-Compute fills this gap by providing an easy-to-use middleware tool for using and contributing geospatial application codes that leverage CI resources. This substantially lowers the learning curve for both geospatial users and developers to access cyberGIS capabilities at scale. CyberGIS-Compute is backed by Virtual ROGER (Resourcing Open Geospatial Education and Research); a geospatial supercomputer with access to a number of readily available popular geospatial libraries.
With CyberGIS-Compute we have designed an easy-to-use middleware and associated Python SDK to provide access to CyberGIS capabilities, allowing geospatial applications to easily scale and employ advanced cyberinfrastructure resources. This presentation will first describe the basics of CyberGIS-Jupyter and CyberGIS-Compute, then introduce the Python SDK for CyberGIS-Compute with a simple example. Then, we will take multiple real-world geospatial applications use-cases like spatial accessibility and wildfire evacuation simulation using agent based modeling. Lastly, we will also descrive mechanism to contribute applications to the CyberGIS-Compute framework.