Skip to content

ZekaiJ/DRL4CONBOTS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DRL4CONBOTS logo

Deep Reinforcement Learning for Construction Robotics

A System-Level Taxonomy and Evidence Map toward Real-World Readiness

Zekai Jin1, Huiguang Wang1, Yihong Tang1, Zhen Dong2,3, Chen Feng4, and Yi Shao1,5,*
1Department of Civil Engineering, McGill University, Montreal, QC, Canada
2Department of Computer Science, University of California, Santa Barbara, Santa Barbara, CA, USA
3NVIDIA Corporation, Santa Clara, CA, USA
4Department of Mechanical and Aerospace Engineering, Tandon School of Engineering, New York University, NY, USA
5Department of Civil Engineering, University of British Columbia, Vancouver, Canada
*Corresponding author: yi.shao@ubc.ca

Automation in Construction Under review status 152 coded instances 75 primary DRL instances A1-A5 framework MIT License Stars Badge

DRL4CONBOTS Evidence Atlas
Preview interactive atlas | Browse static page source | Copy citation metadata

A curated companion repository for Deep Reinforcement Learning for Construction Robotics (DRL4CONBOTS).

This repository accompanies the manuscript under review in Automation in Construction, "Deep Reinforcement Learning for Construction Robotics: A System-Level Taxonomy and Evidence Map toward Real-World Readiness".

DRL-enabled construction robotics is moving from isolated task demonstrations toward system-level questions about authority, runtime assurance, validation exposure, and deployment-relevant evidence. This repository tracks representative papers, taxonomy figures, and evidence patterns for researchers, practitioners, and students working at the intersection of construction automation, robot learning, and field robotics.

Contact: zekai.jin@mail.mcgill.ca

At a glance

Item Current release
Manuscript status Under review in Automation in Construction
Evidence base 152 coded construction-robotics instances
Primary DRL synthesis 75 primary DRL instances across five competency regimes
Framework A1-A5 competency, learning formulation, runtime authority, runtime assurance, and validation exposure
Core diagnostic 35/75 primary instances are AM0 + EVL L0, while 0 reach sustained workflow-integrated deployment

Interactive Evidence Atlas

The repository includes a GitHub Pages-ready interactive atlas in docs/index.html. The atlas provides a high-level entry point for readers who want to scan the framework, inspect readiness diagnostics, filter representative papers, and browse core figures before reading the manuscript.

Preview the interactive atlas

Graphical overview of the DRL4CONBOTS evidence framework for deployment-relevant DRL in construction robotics.

Table of Contents

News and Updates

  • [June 2026] Repository launch: DRL4CONBOTS is prepared as the public companion repository for the manuscript under review in Automation in Construction.
  • [June 2026] Evidence map released: The current repository tracks 152 coded construction-robotics instances, including 75 primary DRL instances.
  • [June 2026] Core finding: Across the 75 primary DRL instances, 35 are simultaneously AM0 and EVL L0, while 0 reach sustained workflow-integrated deployment.

Corpus and Screening

The review uses a report-level PRISMA screening process before coding system-instance evidence for taxonomy and readiness analysis.

PRISMA flow from database search to the final coded evidence base.

Framework and Taxonomy

DRL4CONBOTS is a system-level evidence map for deep reinforcement learning in construction robotics. Instead of ranking algorithms by task success alone, it interprets what each reported robotic system can credibly claim based on its task regime, learning formulation, runtime authority, runtime assurance, and validation exposure.

The review uses five coupled dimensions:

  • A1. Competency regime: earthwork, assembly, lifting, additive/surface processing, or navigation/logistics.
  • A2. Learning formulation: observation interface, action abstraction, decision formalism, objective specification, and training pathway.
  • A3. Runtime authority: who senses, proposes, arbitrates, and executes robot behavior.
  • A4. Runtime assurance: whether execution-time safeguards, constraints, monitors, or fallback layers are disclosed.
  • A5. Validation exposure: where behavior is tested, from simulation-only evaluation to sustained field deployment.

Five construction-robotics competency regimes used to organize task-specific evidence and deployment bottlenecks.

Methodology Figures

Methodological translation logic: reported task outcomes are interpreted as bounded deployment-relevant claim scope through the five-axis framework.

Evidence Diagnostic

A cross-axis view of runtime-assurance disclosure and validation exposure across competency regimes.

Runtime assurance (A4) against validation exposure (A5), with validation-exposure composition by competency regime.

Representative Papers by Category

Selected papers are grouped by the five competency regimes used in the DRL4CONBOTS framework.

Earthwork and Material Processing - 36 primary instances

Earthwork and Material Processing

Function: Learn excavation, grading, loading, material handling, and machine-level interaction where soil, rock, traction, tool load, and material-state evolution dominate claim scope.

Structural Assembly and Installation - 18 primary instances

Structural Assembly and Installation

Function: Learn tolerance-sensitive placement, insertion, joining, tactile correction, and installation where small pose errors can become jamming, wedging, surface damage, or contact-mode failures.

Material Placement and Lifting - 8 primary instances

Material Placement and Lifting

Function: Learn crane, hoist, lift-planning, and suspended-payload behavior where delayed oscillatory dynamics, swept-volume risk, payload variability, and site governance shape readiness claims.

Additive Manufacturing and Surface Processing - 2 primary instances

Additive Manufacturing and Surface Processing

Function: Learn process control and path planning where material rheology, tool wear, cumulative geometry deviation, and irreversible process defects shape the validation burden.

Navigation, Layout, and Logistics Support - 11 primary instances

Navigation, Layout, and Logistics Support

Function: Learn mobility, routing, worker-aware planning, layout support, and logistics decisions where human co-presence, congestion, occlusion, handover, and governance determine claim scope.

Evidence Trends

The current evidence base shows that real-world readiness cannot be inferred from algorithm labels, hardware presence, or field-like demonstrations alone.

Evidence-closure map for deployment-relevant claims in construction robotics.

Key findings from the primary synthesis set:

  • Authority-assurance decoupling: 45/75 primary instances are AM0, and 60/75 remain within AM0-AM1.
  • Simulation without safeguards: 35/75 primary instances are simultaneously AM0 and EVL L0.
  • Exposure-assurance divergence: 20/75 primary instances reach EVL L3 or higher, but only 1 reaches EVL L4 and 0 reach EVL L5.

Primary Evidence Regimes

Regime Primary instances Main bottleneck
Earthwork and Material Processing 36/75 Long-horizon exposure to soil variability, muck, resistance discontinuities, and intervention histories
Structural Assembly and Installation 18/75 Tolerance variation, contact-rich disturbances, access constraints, and intervention reporting
Material Placement and Lifting 8/75 Workflow-coupled lift validation, swing handling, disturbance coverage, and governance
Additive Manufacturing and Surface Processing 2/75 Scale-dependent process physics, material rheology, and end-to-end process assurance
Navigation, Layout, and Logistics Support 11/75 Mixed-traffic exposure, handover, communication stress, recovery, and site governance

Citation

The manuscript is currently under review in Automation in Construction. Citation metadata will be updated after preprint, acceptance, or journal publication.

@misc{jin2026drl4conbots,
  title        = {Deep Reinforcement Learning for Construction Robotics: A System-Level Taxonomy and Evidence Map toward Real-World Readiness},
  author       = {Jin, Zekai and Wang, Huiguang and Tang, Yihong and Dong, Zhen and Feng, Chen and Shao, Yi},
  year         = {2026},
  howpublished = {Companion repository for a manuscript under review in Automation in Construction},
  url          = {https://github.com/ZekaiJ/DRL4CONBOTS}
}

License & contact

Released under the MIT license. Questions, feedback, or collaboration: zekai.jin@mail.mcgill.ca

About

DRL4CONBOTS companion repository for deep reinforcement learning in construction robotics.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors