pith. sign in

arxiv: 2605.17865 · v1 · pith:VVGYSCWTnew · submitted 2026-05-18 · 💻 cs.CV

Imaging Hidden Objects with Consumer LiDAR via Motion Induced Sampling

Pith reviewed 2026-05-20 12:01 UTC · model grok-4.3

classification 💻 cs.CV
keywords NLOS imagingconsumer LiDARmotion-induced samplingnon-line-of-sight3D reconstructionobject trackingcamera localizationmulti-frame fusion
0
0 comments X

The pith

Motion turns consumer LiDAR into a tool for seeing hidden objects in 3D.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that LiDAR sensors already present in many smartphones can recover information about objects hidden from direct view. It does so by introducing a model that accounts for how motion of either the sensor or the target creates additional sampling opportunities across successive frames. This fusion approach compensates for the weak signals and coarse resolution typical of low-cost hardware. A sympathetic reader would care because the demonstrations show that previously lab-only capabilities for hidden-scene recovery can now run on devices costing under one hundred dollars with no extra calibration. The results cover shape recovery, tracking of concealed items, and determining the sensor's own location from indirect signals.

Core claim

The paper claims that a motion-induced aperture sampling model unifies object shape, object motion, and camera motion under one measurement model. This unification supports effective fusion of multiple low-quality frames captured by a smartphone-grade LiDAR. The resulting system performs three-dimensional reconstruction of hidden scenes, tracks single and multiple hidden objects, and localizes the camera itself using signals from concealed references.

What carries the argument

Motion-induced aperture sampling model: a unified measurement model that folds object shape, object motion, and camera motion together to enable multi-frame fusion despite low signal quality.

If this is right

  • Three-dimensional shapes of hidden objects can be recovered from standard consumer LiDAR hardware.
  • Tracking becomes possible for both single and multiple objects that remain outside the direct line of sight.
  • The imaging device can determine its own position by using hidden objects as indirect references.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same motion-based fusion principle might improve other low-cost sensors that suffer from sparse or noisy returns.
  • Real-time versions could support robotic navigation that accounts for spaces behind occluders without adding hardware.
  • Field tests with natural motions and varying surface reflectances would show how far the current demonstrations generalize.

Load-bearing premise

That motion of the object or camera creates a consistent and modelable sampling pattern sufficient to fuse noisy signals into usable reconstructions of hidden geometry.

What would settle it

Run a controlled scan of a known hidden object shape with a moving smartphone LiDAR and check whether the output 3D model matches independent ground-truth measurements of that shape within the expected error bounds.

Figures

Figures reproduced from arXiv: 2605.17865 by Aaron Young, Adithya Pediredla, Akshat Dave, Ramesh Raskar, Siddharth Somasundaram.

Figure 1
Figure 1. Figure 1: Consumer non-line-of-sight imaging. (a) NLOS imaging is possible by turning nearby diffuse surfaces into virtual mirrors that reveal hidden objects. However, NLOS imaging with consumer LiDARs is challenging due to low SNR, the tradeoff between virtual aperture size and sampling density, and joint object and camera motion. We address these challenges using multi-frame fusion. (b) We demonstrate three consum… view at source ↗
Figure 2
Figure 2. Figure 2: Motion-induced aperture sampling model. The space-time measurements can be expressed as the combined effect of object shape, object motion, and camera pose using the convolutional property of the light-cone transform (LCT). LCT aligns scene space and measurement space by transforming depth in scene space as vz = z 2 and time in measurement space as vτ = (cτ /2)2 . where Rτ and Rz are non-linear resampling … view at source ↗
Figure 3
Figure 3. Figure 3: Particle filtering. Particle filtering consists of three steps: particle propagation, evaluation, and resampling. Propagation uses a motion prior and evaluation updates the prior with the data likelihood. moves out of view [51, 52]. In robotics, camera localization assumes that the environment is known when recovering the camera position [53]. Motivated by such applications, we outline a technique for real… view at source ↗
Figure 4
Figure 4. Figure 4: Tracking multiple hidden objects. (a) We can detect and track multiple objects. In this result, we show tracking of one moving object on a translation stage and one static object. The first few frames are localizing the object, and are therefore omitted from the visualization. (b) We demonstrate NLOS hand tracking as an application of multi-object tracking. The predicted right and left hand distributions a… view at source ↗
Figure 5
Figure 5. Figure 5: Imaging hidden diffuse objects. Although the MAS model we propose was derived assuming retroreflective object reflectance, we find empirically that the model can handle diffuse objects. We show that we are still able to reconstruct 3D shape and track objects in real-time (available as supplementary video). The results, however, are inherently worse than those obtained with retroreflective objects because o… view at source ↗
Figure 6
Figure 6. Figure 6: Ill-Posed Tracking with Small Objects and Small Synthetic Aperture. We use the keyhole imaging configuration as an example, where the synthetic aperture is just a single point, and the target hidden object is a point. Under this scenario, the object position cannot be well-constrained even if information is aggregated across multiple frames. position of an extended object, we can also leverage the spatial … view at source ↗
Figure 7
Figure 7. Figure 7: Baseline Tracking Result. We compare our tracking technique to a baseline technique of com￾puting the filtered backprojection reconstruction of the hidden volume then computing the maximum value of the volume. We find that our particle filtering approach is more robust to noise and more computationally efficient. outright fail, the tracking results are substantially noisier because they don’t leverage any … view at source ↗
Figure 8
Figure 8. Figure 8: Real-Time Tracking with ST VL53L8CX SPAD. We demonstrate NLOS imaging with off￾the-shelf hardware and minimal calibration. Available as video result. This preprint has not undergone post-submission improvements or corrections. The Version of Record of this article is published in Nature, and made available online at https://www.nature.com/articles/s41586-026-10502-x [PITH_FULL_IMAGE:figures/full_fig_p018_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Quantitative Tracking Results. The x − y trajectory of a hidden patch is shown. We observe that the tracking error is strongly correlated to the relative position of the object and virtual aperture. We validate this relationship by observing the similar structure between the error map and the aperture distance map, which computes the distance of the object to the aperture’s centroid. the camera is imaging … view at source ↗
Figure 10
Figure 10. Figure 10: Real-Time Handheld Camera Localization. The camera has unstructured 5D motion and is observing a textureless wall, which would be challenging for conventional odometry techniques such as ICP. However, using the hidden object as a visual cue, we are able to recover the camera’s trajectory. Available as video result. This preprint has not undergone post-submission improvements or corrections. The Version of… view at source ↗
read the original abstract

LiDARs are being increasingly deployed for consumer imaging in handheld, wearable, and robotic applications. These sensors can capture the time-of-flight of light at picosecond resolution, which in principle, enables them to capture information about objects hidden from their field of view. While such non-line-of-sight (NLOS) imaging capabilities have been shown on research-grade LiDARs, they are challenging to achieve on consumer devices due to poor signal quality resulting from low laser power, low spatial resolution, and object and camera motion. Inspired by burst photography and synthetic aperture radar, we propose a multi-frame fusion strategy to overcome these challenges and demonstrate NLOS imaging on consumer LiDAR. We first introduce the motion-induced aperture sampling model to unify the effects of object shape, object motion, and camera motion under a single measurement model. Using this model, we demonstrate several NLOS capabilities on a smartphone-grade LiDAR: (1) 3D reconstruction, (2) single and multi-object tracking, and (3) camera localization using hidden objects. Previously, NLOS imaging capabilities were largely restricted to bulky and expensive research-grade hardware that requires extensive setup and calibration. Our results represent a shift towards plug-and-play NLOS imaging, where anyone can image hidden objects with off-the-shelf hardware ($<100) and no additional setup. We believe that democratization of such capabilities will advance consumer applications of NLOS imaging.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces a motion-induced aperture sampling model that unifies the effects of hidden object shape, object motion, and camera motion into a single forward measurement model. This model underpins a multi-frame fusion strategy to enable non-line-of-sight (NLOS) imaging on consumer-grade LiDAR despite low laser power, coarse resolution, and motion. The authors demonstrate three capabilities on smartphone LiDAR hardware: (1) 3D reconstruction of hidden objects, (2) single- and multi-object tracking, and (3) camera localization using hidden objects, positioning the work as a step toward plug-and-play NLOS imaging with off-the-shelf devices under $100 and no extra setup.

Significance. If the central claims hold under the reported hardware constraints, the work would be significant for democratizing NLOS imaging beyond research-grade systems. The unified motion-induced sampling model offers a conceptual advance by treating motion as a constructive sampling mechanism rather than a nuisance, extending ideas from burst photography and synthetic aperture radar to time-of-flight sensors. Explicit credit is due for the practical demonstration on accessible hardware and the three distinct application scenarios, which together suggest a pathway to consumer robotics and AR uses. The absence of quantitative error analysis in the provided abstract, however, leaves the robustness of the inverse problem under low-photon conditions as an open question for the full manuscript.

major comments (2)
  1. [Motion-induced aperture sampling model (methods section)] The motion-induced aperture sampling model (introduced after the abstract and in the methods) is described as unifying object shape, object velocity, and camera trajectory. It is unclear whether the formulation treats motion parameters as known inputs or performs joint estimation with the hidden geometry. Under the low signal-to-noise regime of consumer LiDAR (few photons per frame, coarse angular sampling), even modest velocity or trajectory errors can alias into shape artifacts; without an explicit conditioning analysis or sensitivity study of the inverse problem, it is difficult to confirm that multi-frame fusion reliably separates these factors.
  2. [Experimental results] Section on experimental results: the three claimed capabilities (3D reconstruction, tracking, camera localization) are presented via qualitative visualizations, but the manuscript provides no quantitative metrics (e.g., reconstruction RMSE, tracking precision, localization error), error bars, or comparisons against ground truth or alternative fusion baselines. This omission directly affects the ability to verify that the proposed model overcomes the poor signal quality highlighted in the abstract.
minor comments (2)
  1. [Abstract] The abstract states 'no additional setup' yet the model necessarily requires some knowledge or estimation of motion; a brief clarification of minimal assumptions would improve readability.
  2. [Methods] Notation for the forward model (e.g., variables for shape, velocity, and trajectory) is introduced late; defining it earlier would aid readers in following the unification argument.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback and for recognizing the potential significance of our work in enabling NLOS imaging on consumer-grade hardware. We address each major comment below and describe the revisions planned for the manuscript.

read point-by-point responses
  1. Referee: [Motion-induced aperture sampling model (methods section)] The motion-induced aperture sampling model (introduced after the abstract and in the methods) is described as unifying object shape, object velocity, and camera trajectory. It is unclear whether the formulation treats motion parameters as known inputs or performs joint estimation with the hidden geometry. Under the low signal-to-noise regime of consumer LiDAR (few photons per frame, coarse angular sampling), even modest velocity or trajectory errors can alias into shape artifacts; without an explicit conditioning analysis or sensitivity study of the inverse problem, it is difficult to confirm that multi-frame fusion reliably separates these factors.

    Authors: The motion-induced aperture sampling model is formulated with motion parameters (object velocity and camera trajectory) as known inputs, sourced from the smartphone LiDAR's built-in IMU and visual-inertial odometry. The inverse problem then recovers the hidden geometry via multi-frame fusion under this forward model; joint estimation of motion and shape is not performed in the current implementation to maintain tractability given the low-photon regime. We agree that a sensitivity analysis is important for validating robustness and will add this to the revised methods section, including both simulated perturbations of motion estimates and real-data experiments showing reconstruction degradation within typical consumer-device motion error bounds. revision: yes

  2. Referee: [Experimental results] Section on experimental results: the three claimed capabilities (3D reconstruction, tracking, camera localization) are presented via qualitative visualizations, but the manuscript provides no quantitative metrics (e.g., reconstruction RMSE, tracking precision, localization error), error bars, or comparisons against ground truth or alternative fusion baselines. This omission directly affects the ability to verify that the proposed model overcomes the poor signal quality highlighted in the abstract.

    Authors: We acknowledge that quantitative metrics strengthen verification of the claims, particularly under low signal quality. The current manuscript emphasizes qualitative demonstrations to illustrate real-world feasibility on off-the-shelf hardware, where obtaining precise ground truth for fully hidden NLOS objects is challenging. We will revise the experimental results section to incorporate quantitative evaluations, including RMSE for 3D reconstruction in controlled and semi-synthetic setups, tracking precision and localization error metrics (with error bars from repeated trials), and direct comparisons against single-frame baselines and alternative fusion strategies. revision: yes

Circularity Check

0 steps flagged

Motion-induced aperture model introduced as physically derived unification without reduction to inputs

full rationale

The paper presents the motion-induced aperture sampling model as a new forward model that unifies object shape, object motion, and camera motion under a single measurement equation, explicitly inspired by burst photography and synthetic aperture radar rather than fitted to the NLOS reconstruction targets. No equations in the provided text reduce the model parameters or fusion step to the final 3D reconstruction, tracking, or localization outputs by construction. The central claim of plug-and-play NLOS on consumer hardware rests on this physically motivated multi-frame strategy and low-signal robustness, which remains independent of self-citation chains or ansatz smuggling in the abstract and description. This is the most common honest finding for papers that introduce a new measurement model grounded outside the target result.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 1 invented entities

Only the abstract is available, so specific free parameters, axioms, and invented entities cannot be extracted in detail. The primary new modeling element appears to be the motion-induced aperture sampling model.

invented entities (1)
  • motion-induced aperture sampling model no independent evidence
    purpose: Unify effects of object shape, object motion, and camera motion under one measurement model for multi-frame NLOS fusion
    Presented as the key technical contribution that enables the consumer LiDAR demonstrations.

pith-pipeline@v0.9.0 · 5796 in / 1190 out tokens · 60727 ms · 2026-05-20T12:01:54.365114+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

69 extracted references · 69 canonical work pages · 2 internal anchors

  1. [1]

    See through walls with wifi!,

    F. Adib and D. Katabi, “See through walls with wifi!,” inProceedings of the ACM SIGCOMM 2013 conference on SIGCOMM, pp. 75–86, 2013

  2. [2]

    Cornerradar: Rf-based indoor localization around corners,

    S. Yue, H. He, P. Cao, K. Zha, M. Koizumi, and D. Katabi, “Cornerradar: Rf-based indoor localization around corners,”Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, vol. 6, no. 1, pp. 1–24, 2022

  3. [3]

    Seeing around street corners: Non-line-of-sight detection and tracking in-the-wild using doppler radar,

    N. Scheiner, F. Kraus, F. Wei, B. Phan, F. Mannan, N. Appenrodt, W. Ritter, J. Dickmann, K. Diet- mayer, B. Sick,et al., “Seeing around street corners: Non-line-of-sight detection and tracking in-the-wild using doppler radar,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2068–2077, 2020

  4. [4]

    Thermal non-line-of-sight imaging,

    T. Maeda, Y. Wang, R. Raskar, and A. Kadambi, “Thermal non-line-of-sight imaging,” in2019 IEEE International Conference on Computational Photography (ICCP), pp. 1–11, IEEE, 2019

  5. [5]

    Acoustic non-line-of-sight imaging,

    D. B. Lindell, G. Wetzstein, and V. Koltun, “Acoustic non-line-of-sight imaging,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6780–6789, 2019

  6. [6]

    Fast non- line-of-sight imaging with high-resolution and wide field of view using synthetic wavelength holography,

    F. Willomitzer, P. V. Rangarajan, F. Li, M. M. Balaji, M. P. Christensen, and O. Cossairt, “Fast non- line-of-sight imaging with high-resolution and wide field of view using synthetic wavelength holography,” Nature communications, vol. 12, no. 1, p. 6647, 2021. This preprint has not undergone post-submission improvements or corrections. The Version of Rec...

  7. [7]

    Passive sensing around the corner using spatial coherence,

    M. Batarseh, S. Sukhov, Z. Shen, H. Gemar, R. Rezvani, and A. Dogariu, “Passive sensing around the corner using spatial coherence,”Nature communications, vol. 9, no. 1, p. 3629, 2018

  8. [8]

    Looking through walls and around corners,

    I. Freund, “Looking through walls and around corners,”Physica A: Statistical Mechanics and its Ap- plications, vol. 168, no. 1, pp. 49–65, 1990

  9. [9]

    Non-invasive single-shot imaging through scattering layers and around corners via speckle correlations,

    O. Katz, P. Heidmann, M. Fink, and S. Gigan, “Non-invasive single-shot imaging through scattering layers and around corners via speckle correlations,”Nature photonics, vol. 8, no. 10, pp. 784–790, 2014

  10. [10]

    Tracking multiple objects outside the line of sight using speckle imaging,

    B. M. Smith, M. O’Toole, and M. Gupta, “Tracking multiple objects outside the line of sight using speckle imaging,” inProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6258–6266, 2018

  11. [11]

    Turning corners into cameras: Principles and methods,

    K. L. Bouman, V. Ye, A. B. Yedidia, F. Durand, G. W. Wornell, A. Torralba, and W. T. Freeman, “Turning corners into cameras: Principles and methods,” inProceedings of the IEEE International Conference on Computer Vision, pp. 2270–2278, 2017

  12. [12]

    Computational periscopy with an ordinary digital camera,

    C. Saunders, J. Murray-Bruce, and V. K. Goyal, “Computational periscopy with an ordinary digital camera,”Nature, vol. 565, no. 7740, pp. 472–475, 2019

  13. [13]

    Polarized non-line-of-sight imaging,

    K. Tanaka, Y. Mukaigawa, and A. Kadambi, “Polarized non-line-of-sight imaging,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2136–2145, 2020

  14. [14]

    Looking around the corner using transient imag- ing,

    A. Kirmani, T. Hutchison, J. Davis, and R. Raskar, “Looking around the corner using transient imag- ing,” in2009 IEEE 12th International Conference on Computer Vision, pp. 159–166, IEEE, 2009

  15. [15]

    Recovering three-dimensional shape around a corner using ultrafast time-of-flight imaging,

    A. Velten, T. Willwacher, O. Gupta, A. Veeraraghavan, M. G. Bawendi, and R. Raskar, “Recovering three-dimensional shape around a corner using ultrafast time-of-flight imaging,”Nature communications, vol. 3, no. 1, p. 745, 2012

  16. [16]

    Confocal non-line-of-sight imaging based on the light-cone transform,

    M. O’Toole, D. B. Lindell, and G. Wetzstein, “Confocal non-line-of-sight imaging based on the light-cone transform,”Nature, vol. 555, no. 7696, pp. 338–341, 2018

  17. [17]

    Wave-based non-line-of-sight imaging using fast fk migration,

    D. B. Lindell, G. Wetzstein, and M. O’Toole, “Wave-based non-line-of-sight imaging using fast fk migration,”ACM Transactions on Graphics (ToG), vol. 38, no. 4, pp. 1–13, 2019

  18. [18]

    Non-line-of-sight imaging using phasor-field virtual wave optics,

    X. Liu, I. Guill´ en, M. La Manna, J. H. Nam, S. A. Reza, T. Huu Le, A. Jarabo, D. Gutierrez, and A. Velten, “Non-line-of-sight imaging using phasor-field virtual wave optics,”Nature, vol. 572, no. 7771, pp. 620–623, 2019

  19. [19]

    Phasor field diffraction based reconstruction for fast non-line-of-sight imaging systems,

    X. Liu, S. Bauer, and A. Velten, “Phasor field diffraction based reconstruction for fast non-line-of-sight imaging systems,”Nature communications, vol. 11, no. 1, p. 1645, 2020

  20. [20]

    Diffuse mirrors: 3d reconstruction from diffuse indirect illumination using inexpensive time-of-flight sensors,

    F. Heide, L. Xiao, W. Heidrich, and M. B. Hullin, “Diffuse mirrors: 3d reconstruction from diffuse indirect illumination using inexpensive time-of-flight sensors,” inProceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3222–3229, 2014

  21. [21]

    Femto-photography: capturing and visualizing the propagation of light,

    A. Velten, D. Wu, A. Jarabo, B. Masia, C. Barsi, C. Joshi, E. Lawson, M. Bawendi, D. Gutierrez, and R. Raskar, “Femto-photography: capturing and visualizing the propagation of light,”ACM Transactions on Graphics (ToG), vol. 32, no. 4, pp. 1–8, 2013

  22. [22]

    Non-line-of-sight imaging using a time- gated single photon avalanche diode,

    M. Buttafava, J. Zeman, A. Tosi, K. Eliceiri, and A. Velten, “Non-line-of-sight imaging using a time- gated single photon avalanche diode,”Optics express, vol. 23, no. 16, pp. 20997–21011, 2015

  23. [23]

    A megapixel time-gated spad image sensor for 2d and 3d imaging applications,

    K. Morimoto, A. Ardelean, M.-L. Wu, A. C. Ulku, I. M. Antolovic, C. Bruschini, and E. Char- bon, “A megapixel time-gated spad image sensor for 2d and 3d imaging applications,”arXiv preprint arXiv:1912.12910, 2019

  24. [24]

    3.2 megapixel 3d-stacked charge focusing spad for low-light imaging and depth sensing,

    K. Morimoto, J. Iwata, M. Shinohara, H. Sekine, A. Abdelghafar, H. Tsuchiya, Y. Kuroda, K. Tojima, W. Endo, Y. Maehashi,et al., “3.2 megapixel 3d-stacked charge focusing spad for low-light imaging and depth sensing,” in2021 IEEE International Electron Devices Meeting (IEDM), pp. 20–2, IEEE, 2021. This preprint has not undergone post-submission improvement...

  25. [25]

    A 1.8-um pitch, 47-ps jitter spad array in 130nm sige bicmos process,

    F. Liu and E. Charbon, “A 1.8-um pitch, 47-ps jitter spad array in 130nm sige bicmos process,”arXiv preprint arXiv:2406.13686, 2024

  26. [26]

    Single-photon avalanche diode imagers in biophotonics: review and outlook,

    C. Bruschini, H. Homulle, I. M. Antolovic, S. Burri, and E. Charbon, “Single-photon avalanche diode imagers in biophotonics: review and outlook,”Light: Science & Applications, vol. 8, no. 1, p. 87, 2019

  27. [27]

    A 240×160 3d-stacked spad dtof image sensor with rolling shutter and in-pixel histogram for mobile devices,

    C. Zhang, N. Zhang, Z. Ma, L. Wang, Y. Qin, J. Jia, and K. Zang, “A 240×160 3d-stacked spad dtof image sensor with rolling shutter and in-pixel histogram for mobile devices,”IEEE Open Journal of the Solid-State Circuits Society, vol. 2, pp. 3–11, 2021

  28. [28]

    7.3 a 189×600 back-illuminated stacked spad direct time-of-flight depth sensor for automotive lidar systems,

    O. Kumagai, J. Ohmachi, M. Matsumura, S. Yagi, K. Tayu, K. Amagawa, T. Matsukawa, O. Ozawa, D. Hirono, Y. Shinozuka,et al., “7.3 a 189×600 back-illuminated stacked spad direct time-of-flight depth sensor for automotive lidar systems,” in2021 IEEE International Solid-State Circuits Conference (ISSCC), vol. 64, pp. 110–112, IEEE, 2021

  29. [29]

    “Sony semiconductor solutions to release spad depth sensor for smartphones with high-accuracy, low- power distance measurement performance, powered by the industry’s highest photon detection effi- ciency.”https://www.sony-semicon.com/en/news/2023/2023030601.html, 2023

  30. [30]

    The iPhone 12—LiDAR at your fingertips

    S. Rangwala, “The iPhone 12—LiDAR at your fingertips.”https://www.forbes.com/sites/ sabbirrangwala/2020/11/12/the-iphone-12lidar-at-your-fingertips/, 2020

  31. [31]

    LiDAR technology in Apple Vision Pro

    “LiDAR technology in Apple Vision Pro.”https://www.techinsights.com/blog/ lidar-technology-apple-vision-pro, 2024

  32. [32]

    Sony develops a stacked direct time of flight depth sensor for automotive LiDAR with SPAD pixels, an industry first

    “Sony develops a stacked direct time of flight depth sensor for automotive LiDAR with SPAD pixels, an industry first.”https://www.sony.com/en/SonyInfo/News/Press/202102/21-016E/, 2021

  33. [33]

    Why digital LiDAR is the future

    “Why digital LiDAR is the future.”https://ouster.com/insights/blog/ why-digital-lidar-is-the-future, 2020

  34. [34]

    Convolutional ap- proximations to the general non-line-of-sight imaging operator,

    B. Ahn, A. Dave, A. Veeraraghavan, I. Gkioulekas, and A. C. Sankaranarayanan, “Convolutional ap- proximations to the general non-line-of-sight imaging operator,” inProceedings of the IEEE/CVF In- ternational Conference on Computer Vision, pp. 7889–7899, 2019

  35. [35]

    Fpga accelerator for real-time non-line-of-sight imaging,

    Z. Liao, D. Jiang, X. Liu, A. Velten, Y. Ha, and X. Lou, “Fpga accelerator for real-time non-line-of-sight imaging,”IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 69, no. 2, pp. 721–734, 2021

  36. [36]

    A theory of fermat paths for non-line-of-sight shape reconstruction,

    S. Xin, S. Nousias, K. N. Kutulakos, A. C. Sankaranarayanan, S. G. Narasimhan, and I. Gkioulekas, “A theory of fermat paths for non-line-of-sight shape reconstruction,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 6800–6809, 2019

  37. [37]

    Beyond volumetric albedo–a surface optimiza- tion framework for non-line-of-sight imaging,

    C.-Y. Tsai, A. C. Sankaranarayanan, and I. Gkioulekas, “Beyond volumetric albedo–a surface optimiza- tion framework for non-line-of-sight imaging,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 1545–1555, 2019

  38. [38]

    Non-line-of-sight surface recon- struction using the directional light-cone transform,

    S. I. Young, D. B. Lindell, B. Girod, D. Taubman, and G. Wetzstein, “Non-line-of-sight surface recon- struction using the directional light-cone transform,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 1407–1416, 2020

  39. [39]

    Non-line-of-sight imaging via neural transient fields,

    S. Shen, Z. Wang, P. Liu, Z. Pan, R. Li, T. Gao, S. Li, and J. Yu, “Non-line-of-sight imaging via neural transient fields,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 43, no. 7, pp. 2257–2268, 2021

  40. [40]

    Nlos-neus: Non-line-of-sight neural implicit surface,

    Y. Fujimura, T. Kushida, T. Funatomi, and Y. Mukaigawa, “Nlos-neus: Non-line-of-sight neural implicit surface,”arXiv preprint arXiv:2303.12280, 2023

  41. [41]

    Omni-line- of-sight imaging for holistic shape reconstruction,

    B. Huang, X. Peng, S. Shen, S. Xia, R. Li, Y. Yu, Y. Wang, S. Gao, W. Chen, S. Li,et al., “Omni-line- of-sight imaging for holistic shape reconstruction,”arXiv preprint arXiv:2304.10780, 2023. This preprint has not undergone post-submission improvements or corrections. The Version of Record of this article is published in Nature, and made available online...

  42. [42]

    Learned feature embeddings for non- line-of-sight imaging and recognition,

    W. Chen, F. Wei, K. N. Kutulakos, S. Rusinkiewicz, and F. Heide, “Learned feature embeddings for non- line-of-sight imaging and recognition,”ACM Transactions on Graphics (ToG), vol. 39, no. 6, pp. 1–18, 2020

  43. [43]

    Nlost: Non-line-of-sight imaging with trans- former,

    Y. Li, J. Peng, J. Ye, Y. Zhang, F. Xu, and Z. Xiong, “Nlost: Non-line-of-sight imaging with trans- former,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 13313–13322, 2023

  44. [44]

    Steady-state non-line-of-sight imaging,

    W. Chen, S. Daneau, F. Mannan, and F. Heide, “Steady-state non-line-of-sight imaging,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6790–6799, 2019

  45. [45]

    Low-latency time-of-flight non-line-of-sight imaging at 5 frames per second,

    J. H. Nam, E. Brandt, S. Bauer, X. Liu, M. Renna, A. Tosi, E. Sifakis, and A. Velten, “Low-latency time-of-flight non-line-of-sight imaging at 5 frames per second,”Nature communications, vol. 12, no. 1, p. 6526, 2021

  46. [46]

    Burst photography for high dynamic range and low-light imaging on mobile cameras,

    S. W. Hasinoff, D. Sharlet, R. Geiss, A. Adams, J. T. Barron, F. Kainz, J. Chen, and M. Levoy, “Burst photography for high dynamic range and low-light imaging on mobile cameras,”ACM Transactions on Graphics (ToG), vol. 35, no. 6, pp. 1–12, 2016

  47. [47]

    Quanta burst photography,

    S. Ma, S. Gupta, A. C. Ulku, C. Bruschini, E. Charbon, and M. Gupta, “Quanta burst photography,” ACM Transactions on Graphics (TOG), vol. 39, no. 4, pp. 79–1, 2020

  48. [48]

    Some early developments in synthetic aperture radar systems,

    C. W. Sherwin, J. P. Ruina, and R. Rawcliffe, “Some early developments in synthetic aperture radar systems,”IRE Transactions on Military Electronics, no. 2, pp. 111–115, 1962

  49. [49]

    Kinectfusion: real-time 3d reconstruction and interaction using a moving depth camera,

    S. Izadi, D. Kim, O. Hilliges, D. Molyneaux, R. Newcombe, P. Kohli, J. Shotton, S. Hodges, D. Freeman, A. Davison,et al., “Kinectfusion: real-time 3d reconstruction and interaction using a moving depth camera,” inProceedings of the 24th annual ACM symposium on User interface software and technology, pp. 559–568, 2011

  50. [50]

    A volumetric method for building complex models from range images,

    B. Curless and M. Levoy, “A volumetric method for building complex models from range images,” in Proceedings of the 23rd annual conference on Computer graphics and interactive techniques, pp. 303–312, 1996

  51. [51]

    Densefusion: 6d object pose estimation by iterative dense fusion,

    C. Wang, D. Xu, Y. Zhu, R. Mart´ ın-Mart´ ın, C. Lu, L. Fei-Fei, and S. Savarese, “Densefusion: 6d object pose estimation by iterative dense fusion,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 3343–3352, 2019

  52. [52]

    PoseCNN: A Convolutional Neural Network for 6D Object Pose Estimation in Cluttered Scenes

    Y. Xiang, T. Schmidt, V. Narayanan, and D. Fox, “Posecnn: A convolutional neural network for 6d object pose estimation in cluttered scenes,”arXiv preprint arXiv:1711.00199, 2017

  53. [53]

    Monte carlo localization for mobile robots,

    F. Dellaert, D. Fox, W. Burgard, and S. Thrun, “Monte carlo localization for mobile robots,” inProceed- ings 1999 IEEE international conference on robotics and automation (Cat. No. 99CH36288C), vol. 2, pp. 1322–1328, IEEE, 1999

  54. [54]

    Reconstruction of hidden 3d shapes using diffuse reflections,

    O. Gupta, T. Willwacher, A. Velten, A. Veeraraghavan, and R. Raskar, “Reconstruction of hidden 3d shapes using diffuse reflections,”Optics express, vol. 20, no. 17, pp. 19096–19108, 2012

  55. [55]

    Bayesian statistics without tears: a sampling–resampling perspective,

    A. F. Smith and A. E. Gelfand, “Bayesian statistics without tears: a sampling–resampling perspective,” The American Statistician, vol. 46, no. 2, pp. 84–88, 1992

  56. [56]

    D. W. Scott,Multivariate density estimation: theory, practice, and visualization. John Wiley & Sons, 2015

  57. [57]

    Non-line-of-sight tracking of people at long range,

    S. Chan, R. E. Warburton, G. Gariepy, J. Leach, and D. Faccio, “Non-line-of-sight tracking of people at long range,”Optics express, vol. 25, no. 9, pp. 10109–10117, 2017

  58. [58]

    Tracking hidden objects with a single-photon camera

    G. Gariepy, F. Tonolini, R. Henderson, J. Leach, and D. Faccio, “Tracking hidden objects with a single-photon camera,”arXiv preprint arXiv:1503.01699, 2015. This preprint has not undergone post-submission improvements or corrections. The Version of Record of this article is published in Nature, and made available online at https://www.nature.com/articles/...

  59. [59]

    Nerfies: Deformable neural radiance fields,

    K. Park, U. Sinha, J. T. Barron, S. Bouaziz, D. B. Goldman, S. M. Seitz, and R. Martin-Brualla, “Nerfies: Deformable neural radiance fields,” inProceedings of the IEEE/CVF international conference on computer vision, pp. 5865–5874, 2021

  60. [60]

    D-nerf: Neural radiance fields for dy- namic scenes,

    A. Pumarola, E. Corona, G. Pons-Moll, and F. Moreno-Noguer, “D-nerf: Neural radiance fields for dy- namic scenes,” inProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 10318–10327, 2021

  61. [61]

    Fastslam: A factored solution to the simul- taneous localization and mapping problem,

    M. Montemerlo, S. Thrun, D. Koller, B. Wegbreit,et al., “Fastslam: A factored solution to the simul- taneous localization and mapping problem,”Aaai/iaai, vol. 593598, no. 2, pp. 593–598, 2002

  62. [62]

    Minimalist vision with freeform pixels,

    J. Klotz and S. K. Nayar, “Minimalist vision with freeform pixels,” inEuropean Conference on Computer Vision, pp. 329–346, Springer, 2024

  63. [63]

    Low-power high-performance 8x8 multizone time-of-flight sensor (tof)

    “Low-power high-performance 8x8 multizone time-of-flight sensor (tof).”https://www.st.com/en/ imaging-and-photonics-solutions/vl53l8cx.html, 2021

  64. [64]

    On resampling algorithms for particle filters,

    J. D. Hol, T. B. Schon, and F. Gustafsson, “On resampling algorithms for particle filters,” in2006 IEEE nonlinear statistical signal processing workshop, pp. 79–82, IEEE, 2006

  65. [65]

    Ozdemir,Inverse synthetic aperture radar imaging with MATLAB algorithms

    C. Ozdemir,Inverse synthetic aperture radar imaging with MATLAB algorithms. John Wiley & Sons, 2021

  66. [66]

    Analysis of feature visibility in non-line-of-sight measurements,

    X. Liu, S. Bauer, and A. Velten, “Analysis of feature visibility in non-line-of-sight measurements,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10140– 10148, 2019

  67. [67]

    Keyhole imaging: non-line-of-sight imaging and tracking of moving objects along a single optical path,

    C. A. Metzler, D. B. Lindell, and G. Wetzstein, “Keyhole imaging: non-line-of-sight imaging and tracking of moving objects along a single optical path,”IEEE Transactions on Computational Imaging, vol. 7, pp. 1–12, 2020

  68. [68]

    Single-pixel imaging via compressive sampling,

    M. F. Duarte, M. A. Davenport, D. Takhar, J. N. Laska, T. Sun, K. F. Kelly, and R. G. Baraniuk, “Single-pixel imaging via compressive sampling,”IEEE signal processing magazine, vol. 25, no. 2, pp. 83– 91, 2008

  69. [69]

    Blurred lidar for sharper 3d: Robust handheld 3d scanning with diffuse lidar and rgb,

    N. Behari, A. Young, S. Somasundaram, T. Klinghoffer, A. Dave, and R. Raskar, “Blurred lidar for sharper 3d: Robust handheld 3d scanning with diffuse lidar and rgb,” inProceedings of the Computer Vision and Pattern Recognition Conference, pp. 26954–26964, 2025. This preprint has not undergone post-submission improvements or corrections. The Version of Rec...