Multi-Task Regression-based Learning for Autonomous Unmanned Aerial Vehicle Flight Control within Unstructured Outdoor Environments

Amir Atapour-Abarghouei; Bruna G. Maciel-Pearson; Christopher Holder; Samet Akcay; Toby P. Breckon

arxiv: 1907.08320 · v1 · pith:KJBFAVOBnew · submitted 2019-07-18 · 💻 cs.RO · cs.CV· cs.LG· cs.SY· eess.SY

Multi-Task Regression-based Learning for Autonomous Unmanned Aerial Vehicle Flight Control within Unstructured Outdoor Environments

Bruna G. Maciel-Pearson , Samet Akcay , Amir Atapour-Abarghouei , Christopher Holder , Toby P. Breckon This is my paper

Pith reviewed 2026-05-24 19:27 UTC · model grok-4.3

classification 💻 cs.RO cs.CVcs.LGcs.SYeess.SY

keywords UAV navigationautonomous flightmulti-task learningregressionforest environmentsend-to-end learningGPS-denied navigationsimulation training

0 comments

The pith

Multi-task regression learning generates flight commands for UAVs to navigate and explore forests using only camera images.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces an end-to-end multi-task regression approach that produces navigation and exploration commands for UAVs flying under forest canopy. Traditional methods rely on structured features like trails or external sensors such as GPS, which are often unavailable in dense outdoor settings. The method trains in simulation to map raw images directly to control outputs across multiple tasks at once. Experiments indicate it achieves dense coverage inside search areas, reaches broader regions, works in new environments, and exceeds existing pose estimation methods.

Core claim

The end-to-end multi-task regression-based learning approach defines flight commands for navigation and exploration under the forest canopy regardless of trails or GPS, with experiments showing dense exploration within required perimeters, wider search coverage, generalization to unseen environments, and better results than state-of-the-art techniques in software-in-the-loop testing.

What carries the argument

End-to-end multi-task regression network that maps single images to multiple simultaneous flight control outputs.

If this is right

Enables dense exploration inside designated search perimeters without external positioning aids.
Supports coverage of wider regions than single-task or pose-estimation baselines.
Transfers to previously unseen forest environments without retraining.
Outperforms current state-of-the-art techniques in simulation-based evaluations.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same image-to-command mapping could apply to other GPS-denied settings such as urban canyons or indoor spaces.
Real deployment would require explicit validation that simulator image statistics match actual forest lighting and motion.
Adding detection tasks to the multi-task output could turn the same network into a combined navigator and searcher.

Load-bearing premise

The software simulator produces flight dynamics and camera images close enough to real forests that simulation performance carries over to physical UAVs.

What would settle it

A physical UAV test flight in an actual forest where the learned model loses stable control despite working in simulation.

Figures

Figures reproduced from arXiv: 1907.08320 by Amir Atapour-Abarghouei, Bruna G. Maciel-Pearson, Christopher Holder, Samet Akcay, Toby P. Breckon.

**Figure 2.** Figure 2: The proposed Multi-Task Regression-based Learning approach. The network predicts 3 positional ( [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: 2D representation of flights using the test set. Left image shows the [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗

**Figure 4.** Figure 4: Illustration of the angular rotation of the UAV when the values in the [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗

**Figure 5.** Figure 5: Comparison of each approach when autonomously flying under the canopy of a dense forest, over a snowy mountain and over a plain field. [PITH_FULL_IMAGE:figures/full_fig_p006_5.png] view at source ↗

**Figure 6.** Figure 6: Heat and activation maps from the three last convolutional layers [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗

**Figure 7.** Figure 7: Three last convolutional layers of Bojarski [PITH_FULL_IMAGE:figures/full_fig_p007_7.png] view at source ↗

**Figure 8.** Figure 8: Three last inception layers of Kendall et al. [12] approach [PITH_FULL_IMAGE:figures/full_fig_p007_8.png] view at source ↗

**Figure 9.** Figure 9: Three last convolutional layers of Wang et al. [13] approach. 7-9, the deeper the network, the more specific is the knowledge acquired about the training environment. This phenomenon is mainly attributed to the fact later layers tend not only to retain spatial information but also to learn high-level semantic information about the scene [38], [39], as is the case for the approaches by Kendall et al. [12] (… view at source ↗

read the original abstract

Increased growth in the global Unmanned Aerial Vehicles (UAV) (drone) industry has expanded possibilities for fully autonomous UAV applications. A particular application which has in part motivated this research is the use of UAV in wide area search and surveillance operations in unstructured outdoor environments. The critical issue with such environments is the lack of structured features that could aid in autonomous flight, such as road lines or paths. In this paper, we propose an End-to-End Multi-Task Regression-based Learning approach capable of defining flight commands for navigation and exploration under the forest canopy, regardless of the presence of trails or additional sensors (i.e. GPS). Training and testing are performed using a software in the loop pipeline which allows for a detailed evaluation against state-of-the-art pose estimation techniques. Our extensive experiments demonstrate that our approach excels in performing dense exploration within the required search perimeter, is capable of covering wider search regions, generalises to previously unseen and unexplored environments and outperforms contemporary state-of-the-art techniques.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper trains a multi-task regression model for vision-only UAV navigation under forest canopy and reports good simulation results, but provides no real-world flights or sim-to-real checks.

read the letter

The main point here is a simulation-only study of an end-to-end multi-task regressor that maps images to flight commands for trail-less forest flight. The work applies this architecture to a search-and-surveillance scenario and tests it inside a software-in-the-loop pipeline against pose-estimation baselines. Within the simulator the model appears to achieve denser coverage, wider search areas, and some generalization to unseen patches while beating the compared methods. That is the concrete result on offer. The multi-task setup is a reasonable choice for this kind of control problem because the shared representation can help with the noisy, texture-poor images you get under canopy. The simulation experiments are at least structured enough to measure exploration metrics and hold-out generalization, which is more than many early learning papers manage. The obvious limitation is the total lack of physical UAV data. No real flights, no wind or lighting variation tests, and no quantification of how well the simulator matches actual forest dynamics. Claims about unstructured outdoor environments therefore rest on an unverified transfer assumption. Without that evidence the practical takeaway stays narrow. This paper is mainly for people already working on learning-based UAV control in natural terrain who want to see one more data point on multi-task regression in simulation. Readers who need hardware results or strong sim-to-real arguments will not find much here. I would send it to peer review so the simulation design and baseline comparisons can be examined properly, but any positive decision would need clear language on the missing real-world step.

Referee Report

2 major / 0 minor

Summary. The paper proposes an end-to-end multi-task regression-based learning method to generate flight commands for autonomous UAV navigation and exploration under forest canopy in unstructured outdoor environments, without relying on trails, paths, or GPS. Training and testing occur exclusively in a software-in-the-loop simulator, with claims that the approach achieves dense exploration within search perimeters, covers wider regions, generalizes to unseen environments, and outperforms contemporary state-of-the-art pose estimation techniques.

Significance. If the simulation results transfer to physical UAVs, the work could contribute to learning-based control for GPS-denied forest operations. The multi-task regression formulation for simultaneous navigation and exploration tasks is a reasonable empirical approach in robotics, but the absence of any real-world validation or sim-to-real analysis substantially limits the immediate significance for practical autonomous UAV applications.

major comments (2)

[Abstract] Abstract: the central claims of outperformance, generalization to unseen environments, and superior dense exploration are presented without any quantitative metrics, error bars, dataset sizes, ablation studies, or statistical details, preventing verification of the performance assertions from the given text.
[Abstract] Abstract and evaluation description: all reported results rely solely on software-in-the-loop simulation with no physical UAV flights, no domain-randomization experiments, and no quantification of simulator fidelity to real forest conditions (e.g., canopy dynamics, lighting, or wind); this directly undermines the applicability of the learned commands to actual unstructured outdoor UAV flight.

Simulated Author's Rebuttal

2 responses · 1 unresolved

We thank the referee for the detailed feedback. We address each major comment below, indicating planned revisions where they strengthen the manuscript without misrepresenting the simulation-based scope of the work.

read point-by-point responses

Referee: [Abstract] Abstract: the central claims of outperformance, generalization to unseen environments, and superior dense exploration are presented without any quantitative metrics, error bars, dataset sizes, ablation studies, or statistical details, preventing verification of the performance assertions from the given text.

Authors: We agree that the abstract would benefit from quantitative support. The body of the manuscript contains the requested details from extensive simulation experiments (including metrics on coverage, generalization success, comparisons to pose-estimation baselines, and ablation studies). We will revise the abstract to include key quantitative results, error bars where applicable, and references to dataset sizes and statistical details to make the claims verifiable from the abstract alone. revision: yes
Referee: [Abstract] Abstract and evaluation description: all reported results rely solely on software-in-the-loop simulation with no physical UAV flights, no domain-randomization experiments, and no quantification of simulator fidelity to real forest conditions (e.g., canopy dynamics, lighting, or wind); this directly undermines the applicability of the learned commands to actual unstructured outdoor UAV flight.

Authors: The manuscript states upfront that all training and testing use a software-in-the-loop simulator, and all performance claims are confined to that setting. This choice enables repeatable, controlled comparisons against baselines that would be difficult in the field. We will add an expanded limitations section discussing simulator assumptions, the lack of domain randomization, and the absence of real-world flights or fidelity quantification. The contribution is the multi-task regression formulation and its simulation evaluation; we do not claim direct transfer to physical UAVs. revision: partial

standing simulated objections not resolved

The absence of physical UAV flights, domain-randomization experiments, and quantitative simulator fidelity analysis cannot be addressed by new data collection within the scope of a revision, as these were not part of the original simulation study.

Circularity Check

0 steps flagged

No circularity: empirical ML training and simulation evaluation with no derivations or self-referential predictions

full rationale

The paper describes an end-to-end multi-task regression model trained and tested inside a software-in-the-loop simulator for UAV navigation. No equations, first-principles derivations, or claimed predictions are present that could reduce to fitted parameters by construction. All performance claims (dense exploration, generalization, outperforming SOTA) rest on direct empirical comparison within the same simulator pipeline. No self-citation load-bearing uniqueness theorems, ansatz smuggling, or renaming of known results occur. This is a standard empirical study; the absence of any derivation chain means no circular reduction is possible.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central claim rests on the assumption that simulation images and dynamics are representative, plus standard supervised-learning assumptions that the collected training trajectories cover the distribution of forest scenes encountered at test time. No new physical entities or ad-hoc constants are introduced beyond typical neural-network hyperparameters.

free parameters (1)

network weights
Learned from simulation trajectories; central claim depends on these fitted values generalizing.

axioms (1)

domain assumption Simulation-to-reality gap is small enough for performance transfer
Invoked when authors extrapolate simulation results to real UAV flight capability.

pith-pipeline@v0.9.0 · 5746 in / 1164 out tokens · 16613 ms · 2026-05-24T19:27:03.134307+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/AbsoluteFloorClosure.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

End-to-End Multi-Task Regression-based Learning approach capable of defining flight commands for navigation and exploration under the forest canopy... Training and testing are performed using a software in the loop pipeline
IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Our network is based on a Multi-Task Regression-based Learning (MTRL) approach... cost function θ minimises the Euclidean distance

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

39 extracted references · 39 canonical work pages · 6 internal anchors

[1]

A survey of unmanned aerial vehicle (UA V) usage for imagery collection in disaster research and manage- ment,

S. M. Adams and C. J. Friedland, “A survey of unmanned aerial vehicle (UA V) usage for imagery collection in disaster research and manage- ment,” in Int. Workshop on Remote Sensing for Disaster Response , 2011, p. 8. 1

work page 2011
[2]

Adapting open-source drone autopilots for real-time iceberg observations,

D. F. Carlson and S. Rysgaard, “Adapting open-source drone autopilots for real-time iceberg observations,” MethodsX, vol. 5, pp. 1059–1072,

work page
[3]

The potential use of unmanned aircraft systems (drones) in mountain search and rescue operations,

Y . Karaca, M. Cicek, O. Tatli, A. Sahin, S. Pasli, M. F. Beser, and S. Turedi, “The potential use of unmanned aircraft systems (drones) in mountain search and rescue operations,” American J. Emergency Medicine, vol. 36, no. 4, pp. 583–588, 2018. 1

work page 2018
[4]

Forestry applications of UA Vs in Europe: A review,

C. Torresan, A. Berton, F. Carotenuto, S. Filippo Di Gennaro, B. Gioli, A. Matese, F. Miglietta, C. Vagnoli, A. Zaldei, and L. Wallace, “Forestry applications of UA Vs in Europe: A review,” International Journal of Remote Sensing , no. 38, pp. 2427–2447, 2017. 1, 2

work page 2017
[5]

Y . B. Sebbane, Intelligent Autonomy of UA Vs: Advanced Missions and Future Use. Chapman and Hall/CRC, 2018. 1

work page 2018
[6]

Survey on computer vision for UA Vs: Current developments and trends,

C. Kanellakis and G. Nikolakopoulos, “Survey on computer vision for UA Vs: Current developments and trends,” J. Intelligent & Robotic Systems, vol. 87, no. 1, pp. 141–168, 2017. 1, 2

work page 2017
[7]

An architecture for robust UA V navigation in GPS-denied areas,

F. J. Perez-Grau, R. Ragel, F. Caballero, A. Viguria, and A. Ollero, “An architecture for robust UA V navigation in GPS-denied areas,” J. Field Robotics, vol. 35, no. 1, pp. 121–145, 2018. 1, 2

work page 2018
[8]

Simulation tools, environments and frameworks for UA V systems performance analysis,

A. I. Hentati, L. Krichen, M. Fourati, and L. C. Fourati, “Simulation tools, environments and frameworks for UA V systems performance analysis,” in Int. Conf. Wireless Communications & Mobile Computing . IEEE, 2018, pp. 1495–1500. 1

work page 2018
[9]

AirSim: High-ﬁdelity visual and physical simulation for autonomous vehicles,

S. Shah, D. Dey, C. Lovett, and A. Kapoor, “AirSim: High-ﬁdelity visual and physical simulation for autonomous vehicles,” in Field and Service Robotics, 2017, pp. 621–635. 1

work page 2017
[10]

Airsim: High-ﬁdelity visual and physical simulation for au- tonomous vehicles,

——, “Airsim: High-ﬁdelity visual and physical simulation for au- tonomous vehicles,” in Field and Service Robotics . Springer, 2018, pp. 621–635. 1, 2

work page 2018
[11]

FlyMASTER: Multi-UA V control and supervision with ROS,

A. P. Lamping, J. N. Ouwerkerk, N. O. Stockton, K. Cohen, M. Kumar, and D. W. Casbeer, “FlyMASTER: Multi-UA V control and supervision with ROS,” in Aviation Technology, Integration, and Operations Con- ference, 2018. 1

work page 2018
[12]

Posenet: A convolutional network for real-time 6-DOF camera relocalization,

A. Kendall, M. Grimes, and R. Cipolla, “Posenet: A convolutional network for real-time 6-DOF camera relocalization,” in Int. Conf. Computer Vision, 2015, pp. 2938–2946. 2, 3, 4, 5, 6, 7

work page 2015
[13]

DeepVO: Towards end-to- end visual odometry with deep recurrent convolutional neural networks,

S. Wang, R. Clark, H. Wen, and N. Trigoni, “DeepVO: Towards end-to- end visual odometry with deep recurrent convolutional neural networks,” in Int. Conf. Robotics and Automation . IEEE, 2017, pp. 2043–2050. 2, 3, 4, 5, 6, 7

work page 2017
[14]

End to End Learning for Self-Driving Cars

M. Bojarski, D. Del Testa, D. Dworakowski, B. Firner, B. Flepp, P. Goyal, L. D. Jackel, M. Monfort, U. Muller, J. Zhang et al. , “End to end learning for self-driving cars,” arXiv preprint arXiv:1604.07316 ,

work page internal anchor Pith review Pith/arXiv arXiv
[15]

Review and analysis of solutions of the three point perspective pose estimation problem,

B. M. Haralick, C.-N. Lee, K. Ottenberg, and M. N ¨olle, “Review and analysis of solutions of the three point perspective pose estimation problem,” Int. J. Computer Vision , vol. 13, no. 3, pp. 331–356, 1994. 2

work page 1994
[16]

Vision and Learning for Deliberative Monocular Cluttered Flight

D. Dey, K. S. Shankar, S. Zeng, R. Mehta, M. T. Agcayazi, C. Eriksen, S. Daftry, M. Hebert, and J. A. Bagnell, “Vision and learning for deliberative monocular cluttered ﬂight,” arXiv preprint arXiv:1411.6326,

work page internal anchor Pith review Pith/arXiv arXiv
[17]

Uav ﬂight experiments applied to the remote sensing of vegetated areas,

E. Salam ´ı, C. Barrado, and E. Pastor, “Uav ﬂight experiments applied to the remote sensing of vegetated areas,” Remote Sensing , vol. 6, no. 11, pp. 11 051–11 081, 2014. 2

work page 2014
[18]

Toward low- ﬂying autonomous MA V trail navigation using deep neural networks for environmental awareness,

N. Smolyanskiy, A. Kamenev, J. Smith, and S. Birchﬁeld, “Toward low- ﬂying autonomous MA V trail navigation using deep neural networks for environmental awareness,” Int. Conf. Intelligent Robots and Systems ,

work page
[19]

Plato: Policy learning using adaptive trajectory optimization,

G. Kahn, T. Zhang, S. Levine, and P. Abbeel, “Plato: Policy learning using adaptive trajectory optimization,” in 2017 IEEE International Conference on Robotics and Automation (ICRA) . IEEE, 2017, pp. 3342–3349. 2

work page 2017
[20]

Maciel-Pearson, P

B. Maciel-Pearson, P. Carbonneau, and T. Breckon, Extending Deep Neural Network Trail Navigation for Unmanned Aerial V ehicle Opera- tion within the F orest Canopy , 2018. 2

work page 2018
[21]

Aggressive Deep Driving: Model Predictive Control with a CNN Cost Model

P. Drews, G. Williams, B. Goldfain, E. A. Theodorou, and J. M. Rehg, “Aggressive deep driving: Model predictive control with a cnn cost model,” arXiv preprint arXiv:1707.05303 , 2017. 2

work page internal anchor Pith review Pith/arXiv arXiv 2017
[22]

CAD2RL: Real Single-Image Flight without a Single Real Image

F. Sadeghi and S. Levine, “Cad2rl: Real single-image ﬂight without a single real image,” arXiv preprint arXiv:1611.04201 , 2016. 2

work page internal anchor Pith review Pith/arXiv arXiv 2016
[23]

Unsupervised learning of depth and ego-motion from video,

T. Zhou, M. Brown, N. Snavely, and D. G. Lowe, “Unsupervised learning of depth and ego-motion from video,” in IEEE Conf. Computer Vision and Pattern Recognition , 2017, pp. 1851–1858. 2

work page 2017
[24]

Demon: Depth and motion network for learning monocular stereo,

B. Ummenhofer, H. Zhou, J. Uhrig, N. Mayer, E. Ilg, A. Dosovitskiy, and T. Brox, “Demon: Depth and motion network for learning monocular stereo,” in IEEE Conf. Computer Vision and Pattern Recognition , 2017, pp. 5038–5047. 2

work page 2017
[25]

Geonet: Unsupervised learning of dense depth, optical ﬂow and camera pose,

Z. Yin and J. Shi, “Geonet: Unsupervised learning of dense depth, optical ﬂow and camera pose,” in IEEE Conf. Computer Vision and Pattern Recognition, 2018, pp. 1983–1992. 2

work page 2018
[26]

Veritatem dies aperit- temporally consistent depth prediction enabled by a multi-task geometric and semantic scene understanding approach,

A. Atapour-Abarghouei and T. P. Breckon, “Veritatem dies aperit- temporally consistent depth prediction enabled by a multi-task geometric and semantic scene understanding approach,” in IEEE Conf. Computer Vision and Pattern Recognition , 2019. 2

work page 2019
[27]

Deep Drone Racing: Learning Agile Flight in Dynamic Environments

E. Kaufmann, A. Loquercio, R. Ranftl, A. Dosovitskiy, V . Koltun, and D. Scaramuzza, “Deep drone racing: Learning agile ﬂight in dynamic environments,” arXiv preprint arXiv:1806.08548 , 2018. 2

work page internal anchor Pith review Pith/arXiv arXiv 2018
[28]

Fast, autonomous ﬂight in gps-denied and cluttered environments,

K. Mohta, M. Watterson, Y . Mulgaonkar, S. Liu, C. Qu, A. Makineni, K. Saulnier, K. Sun, A. Zhu, J. Delmerico et al., “Fast, autonomous ﬂight in gps-denied and cluttered environments,” Journal of Field Robotics , vol. 35, no. 1, pp. 101–120, 2018. 2

work page 2018
[29]

Perception, guidance, and navigation for indoor autonomous drone racing using deep learning,

S. Jung, S. Hwang, H. Shin, and D. H. Shim, “Perception, guidance, and navigation for indoor autonomous drone racing using deep learning,” IEEE Robotics and Automation Letters , vol. 3, no. 3, pp. 2539–2544,

work page
[30]

UAS navigation with SqueezePoseNetaccu- racy boosting for pose regression by data augmentation,

M. S. Mueller and B. Jutzi, “UAS navigation with SqueezePoseNetaccu- racy boosting for pose regression by data augmentation,” Drones, vol. 2, no. 1, p. 7, 2018. 2

work page 2018
[31]

Long short-term memory,

S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural computation, vol. 9, no. 8, pp. 1735–1780, 1997. 2

work page 1997
[32]

Real shading in Unreal engine 4,

B. Karis and E. Games, “Real shading in Unreal engine 4,” Physically Based Shading Theory Practice , vol. 4, 2013. 2

work page 2013
[33]

Representing attitude: Euler angles, unit quaternions, and rotation vectors,

J. Diebel, “Representing attitude: Euler angles, unit quaternions, and rotation vectors,” Matrix, vol. 58, no. 15-16, pp. 1–35, 2006. 2

work page 2006
[34]

Novi commentarii academiae scientiarum petropolitanae,

L. Euler, “Novi commentarii academiae scientiarum petropolitanae,”

work page
[35]

Full quaternion based attitude control for a quadrotor,

E. Fresk and G. Nikolakopoulos, “Full quaternion based attitude control for a quadrotor,” in Euro. Control Conference. IEEE, 2013, pp. 3864–

work page 2013
[36]

Adam: A Method for Stochastic Optimization

D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” arXiv preprint arXiv:1412.6980 , 2014. 4

work page internal anchor Pith review Pith/arXiv arXiv 2014
[37]

Geo-fencing for unmanned aerial vehicle,

P. Pratyusha and V . Naidu, “Geo-fencing for unmanned aerial vehicle,” Int. J. Computer Applications , 2013. 6

work page 2013
[38]

Safe visual navigation via deep learning and novelty detection,

C. Richter and N. Roy, “Safe visual navigation via deep learning and novelty detection,” 2017. 7

work page 2017
[39]

Grad-cam: Visual explanations from deep networks via gradient-based localization,

R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh, and D. Batra, “Grad-cam: Visual explanations from deep networks via gradient-based localization,” in Proceedings of the IEEE International Conference on Computer Vision , 2017, pp. 618–626. 7

work page 2017

[1] [1]

A survey of unmanned aerial vehicle (UA V) usage for imagery collection in disaster research and manage- ment,

S. M. Adams and C. J. Friedland, “A survey of unmanned aerial vehicle (UA V) usage for imagery collection in disaster research and manage- ment,” in Int. Workshop on Remote Sensing for Disaster Response , 2011, p. 8. 1

work page 2011

[2] [2]

Adapting open-source drone autopilots for real-time iceberg observations,

D. F. Carlson and S. Rysgaard, “Adapting open-source drone autopilots for real-time iceberg observations,” MethodsX, vol. 5, pp. 1059–1072,

work page

[3] [3]

The potential use of unmanned aircraft systems (drones) in mountain search and rescue operations,

Y . Karaca, M. Cicek, O. Tatli, A. Sahin, S. Pasli, M. F. Beser, and S. Turedi, “The potential use of unmanned aircraft systems (drones) in mountain search and rescue operations,” American J. Emergency Medicine, vol. 36, no. 4, pp. 583–588, 2018. 1

work page 2018

[4] [4]

Forestry applications of UA Vs in Europe: A review,

C. Torresan, A. Berton, F. Carotenuto, S. Filippo Di Gennaro, B. Gioli, A. Matese, F. Miglietta, C. Vagnoli, A. Zaldei, and L. Wallace, “Forestry applications of UA Vs in Europe: A review,” International Journal of Remote Sensing , no. 38, pp. 2427–2447, 2017. 1, 2

work page 2017

[5] [5]

Y . B. Sebbane, Intelligent Autonomy of UA Vs: Advanced Missions and Future Use. Chapman and Hall/CRC, 2018. 1

work page 2018

[6] [6]

Survey on computer vision for UA Vs: Current developments and trends,

C. Kanellakis and G. Nikolakopoulos, “Survey on computer vision for UA Vs: Current developments and trends,” J. Intelligent & Robotic Systems, vol. 87, no. 1, pp. 141–168, 2017. 1, 2

work page 2017

[7] [7]

An architecture for robust UA V navigation in GPS-denied areas,

F. J. Perez-Grau, R. Ragel, F. Caballero, A. Viguria, and A. Ollero, “An architecture for robust UA V navigation in GPS-denied areas,” J. Field Robotics, vol. 35, no. 1, pp. 121–145, 2018. 1, 2

work page 2018

[8] [8]

Simulation tools, environments and frameworks for UA V systems performance analysis,

A. I. Hentati, L. Krichen, M. Fourati, and L. C. Fourati, “Simulation tools, environments and frameworks for UA V systems performance analysis,” in Int. Conf. Wireless Communications & Mobile Computing . IEEE, 2018, pp. 1495–1500. 1

work page 2018

[9] [9]

AirSim: High-ﬁdelity visual and physical simulation for autonomous vehicles,

S. Shah, D. Dey, C. Lovett, and A. Kapoor, “AirSim: High-ﬁdelity visual and physical simulation for autonomous vehicles,” in Field and Service Robotics, 2017, pp. 621–635. 1

work page 2017

[10] [10]

Airsim: High-ﬁdelity visual and physical simulation for au- tonomous vehicles,

——, “Airsim: High-ﬁdelity visual and physical simulation for au- tonomous vehicles,” in Field and Service Robotics . Springer, 2018, pp. 621–635. 1, 2

work page 2018

[11] [11]

FlyMASTER: Multi-UA V control and supervision with ROS,

A. P. Lamping, J. N. Ouwerkerk, N. O. Stockton, K. Cohen, M. Kumar, and D. W. Casbeer, “FlyMASTER: Multi-UA V control and supervision with ROS,” in Aviation Technology, Integration, and Operations Con- ference, 2018. 1

work page 2018

[12] [12]

Posenet: A convolutional network for real-time 6-DOF camera relocalization,

A. Kendall, M. Grimes, and R. Cipolla, “Posenet: A convolutional network for real-time 6-DOF camera relocalization,” in Int. Conf. Computer Vision, 2015, pp. 2938–2946. 2, 3, 4, 5, 6, 7

work page 2015

[13] [13]

DeepVO: Towards end-to- end visual odometry with deep recurrent convolutional neural networks,

S. Wang, R. Clark, H. Wen, and N. Trigoni, “DeepVO: Towards end-to- end visual odometry with deep recurrent convolutional neural networks,” in Int. Conf. Robotics and Automation . IEEE, 2017, pp. 2043–2050. 2, 3, 4, 5, 6, 7

work page 2017

[14] [14]

End to End Learning for Self-Driving Cars

M. Bojarski, D. Del Testa, D. Dworakowski, B. Firner, B. Flepp, P. Goyal, L. D. Jackel, M. Monfort, U. Muller, J. Zhang et al. , “End to end learning for self-driving cars,” arXiv preprint arXiv:1604.07316 ,

work page internal anchor Pith review Pith/arXiv arXiv

[15] [15]

Review and analysis of solutions of the three point perspective pose estimation problem,

B. M. Haralick, C.-N. Lee, K. Ottenberg, and M. N ¨olle, “Review and analysis of solutions of the three point perspective pose estimation problem,” Int. J. Computer Vision , vol. 13, no. 3, pp. 331–356, 1994. 2

work page 1994

[16] [16]

Vision and Learning for Deliberative Monocular Cluttered Flight

D. Dey, K. S. Shankar, S. Zeng, R. Mehta, M. T. Agcayazi, C. Eriksen, S. Daftry, M. Hebert, and J. A. Bagnell, “Vision and learning for deliberative monocular cluttered ﬂight,” arXiv preprint arXiv:1411.6326,

work page internal anchor Pith review Pith/arXiv arXiv

[17] [17]

Uav ﬂight experiments applied to the remote sensing of vegetated areas,

E. Salam ´ı, C. Barrado, and E. Pastor, “Uav ﬂight experiments applied to the remote sensing of vegetated areas,” Remote Sensing , vol. 6, no. 11, pp. 11 051–11 081, 2014. 2

work page 2014

[18] [18]

Toward low- ﬂying autonomous MA V trail navigation using deep neural networks for environmental awareness,

N. Smolyanskiy, A. Kamenev, J. Smith, and S. Birchﬁeld, “Toward low- ﬂying autonomous MA V trail navigation using deep neural networks for environmental awareness,” Int. Conf. Intelligent Robots and Systems ,

work page

[19] [19]

Plato: Policy learning using adaptive trajectory optimization,

G. Kahn, T. Zhang, S. Levine, and P. Abbeel, “Plato: Policy learning using adaptive trajectory optimization,” in 2017 IEEE International Conference on Robotics and Automation (ICRA) . IEEE, 2017, pp. 3342–3349. 2

work page 2017

[20] [20]

Maciel-Pearson, P

B. Maciel-Pearson, P. Carbonneau, and T. Breckon, Extending Deep Neural Network Trail Navigation for Unmanned Aerial V ehicle Opera- tion within the F orest Canopy , 2018. 2

work page 2018

[21] [21]

Aggressive Deep Driving: Model Predictive Control with a CNN Cost Model

P. Drews, G. Williams, B. Goldfain, E. A. Theodorou, and J. M. Rehg, “Aggressive deep driving: Model predictive control with a cnn cost model,” arXiv preprint arXiv:1707.05303 , 2017. 2

work page internal anchor Pith review Pith/arXiv arXiv 2017

[22] [22]

CAD2RL: Real Single-Image Flight without a Single Real Image

F. Sadeghi and S. Levine, “Cad2rl: Real single-image ﬂight without a single real image,” arXiv preprint arXiv:1611.04201 , 2016. 2

work page internal anchor Pith review Pith/arXiv arXiv 2016

[23] [23]

Unsupervised learning of depth and ego-motion from video,

T. Zhou, M. Brown, N. Snavely, and D. G. Lowe, “Unsupervised learning of depth and ego-motion from video,” in IEEE Conf. Computer Vision and Pattern Recognition , 2017, pp. 1851–1858. 2

work page 2017

[24] [24]

Demon: Depth and motion network for learning monocular stereo,

B. Ummenhofer, H. Zhou, J. Uhrig, N. Mayer, E. Ilg, A. Dosovitskiy, and T. Brox, “Demon: Depth and motion network for learning monocular stereo,” in IEEE Conf. Computer Vision and Pattern Recognition , 2017, pp. 5038–5047. 2

work page 2017

[25] [25]

Geonet: Unsupervised learning of dense depth, optical ﬂow and camera pose,

Z. Yin and J. Shi, “Geonet: Unsupervised learning of dense depth, optical ﬂow and camera pose,” in IEEE Conf. Computer Vision and Pattern Recognition, 2018, pp. 1983–1992. 2

work page 2018

[26] [26]

Veritatem dies aperit- temporally consistent depth prediction enabled by a multi-task geometric and semantic scene understanding approach,

A. Atapour-Abarghouei and T. P. Breckon, “Veritatem dies aperit- temporally consistent depth prediction enabled by a multi-task geometric and semantic scene understanding approach,” in IEEE Conf. Computer Vision and Pattern Recognition , 2019. 2

work page 2019

[27] [27]

Deep Drone Racing: Learning Agile Flight in Dynamic Environments

E. Kaufmann, A. Loquercio, R. Ranftl, A. Dosovitskiy, V . Koltun, and D. Scaramuzza, “Deep drone racing: Learning agile ﬂight in dynamic environments,” arXiv preprint arXiv:1806.08548 , 2018. 2

work page internal anchor Pith review Pith/arXiv arXiv 2018

[28] [28]

Fast, autonomous ﬂight in gps-denied and cluttered environments,

K. Mohta, M. Watterson, Y . Mulgaonkar, S. Liu, C. Qu, A. Makineni, K. Saulnier, K. Sun, A. Zhu, J. Delmerico et al., “Fast, autonomous ﬂight in gps-denied and cluttered environments,” Journal of Field Robotics , vol. 35, no. 1, pp. 101–120, 2018. 2

work page 2018

[29] [29]

Perception, guidance, and navigation for indoor autonomous drone racing using deep learning,

S. Jung, S. Hwang, H. Shin, and D. H. Shim, “Perception, guidance, and navigation for indoor autonomous drone racing using deep learning,” IEEE Robotics and Automation Letters , vol. 3, no. 3, pp. 2539–2544,

work page

[30] [30]

UAS navigation with SqueezePoseNetaccu- racy boosting for pose regression by data augmentation,

M. S. Mueller and B. Jutzi, “UAS navigation with SqueezePoseNetaccu- racy boosting for pose regression by data augmentation,” Drones, vol. 2, no. 1, p. 7, 2018. 2

work page 2018

[31] [31]

Long short-term memory,

S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural computation, vol. 9, no. 8, pp. 1735–1780, 1997. 2

work page 1997

[32] [32]

Real shading in Unreal engine 4,

B. Karis and E. Games, “Real shading in Unreal engine 4,” Physically Based Shading Theory Practice , vol. 4, 2013. 2

work page 2013

[33] [33]

Representing attitude: Euler angles, unit quaternions, and rotation vectors,

J. Diebel, “Representing attitude: Euler angles, unit quaternions, and rotation vectors,” Matrix, vol. 58, no. 15-16, pp. 1–35, 2006. 2

work page 2006

[34] [34]

Novi commentarii academiae scientiarum petropolitanae,

L. Euler, “Novi commentarii academiae scientiarum petropolitanae,”

work page

[35] [35]

Full quaternion based attitude control for a quadrotor,

E. Fresk and G. Nikolakopoulos, “Full quaternion based attitude control for a quadrotor,” in Euro. Control Conference. IEEE, 2013, pp. 3864–

work page 2013

[36] [36]

Adam: A Method for Stochastic Optimization

D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” arXiv preprint arXiv:1412.6980 , 2014. 4

work page internal anchor Pith review Pith/arXiv arXiv 2014

[37] [37]

Geo-fencing for unmanned aerial vehicle,

P. Pratyusha and V . Naidu, “Geo-fencing for unmanned aerial vehicle,” Int. J. Computer Applications , 2013. 6

work page 2013

[38] [38]

Safe visual navigation via deep learning and novelty detection,

C. Richter and N. Roy, “Safe visual navigation via deep learning and novelty detection,” 2017. 7

work page 2017

[39] [39]

Grad-cam: Visual explanations from deep networks via gradient-based localization,

R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh, and D. Batra, “Grad-cam: Visual explanations from deep networks via gradient-based localization,” in Proceedings of the IEEE International Conference on Computer Vision , 2017, pp. 618–626. 7

work page 2017