pith. sign in

arxiv: 2507.16818 · v1 · submitted 2025-04-30 · 💻 cs.LG

Evaluating Artificial Intelligence Algorithms for the Standardization of Transtibial Prosthetic Socket Shape Design

Pith reviewed 2026-05-22 17:43 UTC · model grok-4.3

classification 💻 cs.LG
keywords artificial intelligenceprosthetic sockettranstibial prosthesisrandom forest3D scanningshape predictionadaptation modelingmachine learning
0
0 comments X

The pith

AI can predict transtibial prosthetic socket shapes from limb scans by learning prosthetist adaptations, with random forest reaching 1.24 mm median error.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tests whether artificial intelligence can reduce reliance on manual, skill-dependent fitting of transtibial prosthetic sockets. It uses 3D scans of residual limbs and the corresponding expert-designed sockets from 118 patients to train models that either directly output the full socket shape or predict the specific adaptations the prosthetist applied to the limb. Adaptation prediction proves more accurate than direct shape prediction for all algorithms tested. The random forest model on adaptations delivers the lowest overall error, measured as median surface-to-surface distance of 1.24 mm with first and third quartiles at 1.03 mm and 1.54 mm. A reader would care because more consistent socket design could improve fit reliability for patients regardless of the individual prosthetist's experience.

Core claim

Estimating the adaptations performed by prosthetists outperforms direct prediction of the final socket shape, and a random forest model applied to this adaptation task produces sockets whose median surface-to-surface distance from the expert designs is 1.24 millimeters.

What carries the argument

Adaptation prediction, in which models learn the modifications a prosthetist makes to a 3D limb scan to arrive at the socket, implemented after PCA-based compression and morphable model alignment.

Load-bearing premise

The sockets designed by prosthetists in the 118-patient dataset represent the appropriate target for standardization rather than one acceptable choice among several valid fits.

What would settle it

A new dataset in which multiple independent prosthetists each produce an acceptable socket for the same limbs; if the AI predictions systematically fall outside the variation among those expert sockets, the claim that the models standardize to an appropriate target would be falsified.

Figures

Figures reproduced from arXiv: 2507.16818 by C.H.E. Jordaan, G.A. de Jong, M. van der Stelt, R. Leijendekkers, T.J.J. Maal, T. Kachman, V.M.A. Stirler.

Figure 1
Figure 1. Figure 1: Example of the used data. A 3D scan of a stump (left) and the corresponding 3D model of the socket [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Three main challenges of applying artificial intelligence on point clouds/meshes [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: The five-step digital pre-processing workflow, consisting of initial data, manual adjustments, mirroring of [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: The algorithms use the 3D scan of the stump as input and predict either a) the adaptations, where red [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Architectures of 3D neural network 7 [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Architectures of Feedforward neural network [PITH_FULL_IMAGE:figures/full_fig_p008_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Average adjustments from stump to socket in millimeters. Red indicates reduction, and green indicates [PITH_FULL_IMAGE:figures/full_fig_p010_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: The absolute location of the surface-to-surface error for the random forest predicting the adaptations in [PITH_FULL_IMAGE:figures/full_fig_p011_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Distance maps of the surface-to-surface error for best, median, and worst test case. Shown for the [PITH_FULL_IMAGE:figures/full_fig_p012_9.png] view at source ↗
read the original abstract

The quality of a transtibial prosthetic socket depends on the prosthetist's skills and expertise, as the fitting is performed manually. This study investigates multiple artificial intelligence (AI) approaches to help standardize transtibial prosthetic socket design. Data from 118 patients were collected by prosthetists working in the Dutch healthcare system. This data consists of a three-dimensional (3D) scan of the residual limb and a corresponding 3D model of the prosthetist-designed socket. Multiple data pre-processing steps are performed for alignment, standardization and optionally compression using Morphable Models and Principal Component Analysis. Afterward, three different algorithms - a 3D neural network, Feedforward neural network, and random forest - are developed to either predict 1) the final socket shape or 2) the adaptations performed by a prosthetist to predict the socket shape based on the 3D scan of the residual limb. Each algorithm's performance was evaluated by comparing the prosthetist-designed socket with the AI-generated socket, using two metrics in combination with the error location. First, we measure the surface-to-surface distance to assess the overall surface error between the AI-generated socket and the prosthetist-designed socket. Second, distance maps between the AI-generated and prosthetist sockets are utilized to analyze the error's location. For all algorithms, estimating the required adaptations outperformed direct prediction of the final socket shape. The random forest model applied to adaptation prediction yields the lowest error with a median surface-to-surface distance of 1.24 millimeters, a first quartile of 1.03 millimeters, and a third quartile of 1.54 millimeters.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript evaluates three AI algorithms (3D neural network, feedforward neural network, and random forest) for predicting transtibial prosthetic socket shapes from 3D residual limb scans using a 118-patient Dutch clinical dataset. It compares direct socket prediction against predicting prosthetist-performed adaptations, reporting that random forest adaptation prediction achieves the lowest error with a median surface-to-surface distance of 1.24 mm (Q1 1.03 mm, Q3 1.54 mm) relative to the prosthetist-designed sockets, and concludes this supports standardization of socket design.

Significance. If the results hold under proper validation, the work provides a concrete baseline for AI-assisted prosthetic design by showing millimeter-scale surface accuracy on real patient data and demonstrating that adaptation prediction outperforms direct shape regression. The multi-model comparison and use of surface distance plus error maps offer practical insights for the field. Significance is tempered by the absence of inter-expert variability benchmarks or outcome linkage, which limits claims about achieving true standardization rather than replicating one expert's style.

major comments (2)
  1. [Results] Results section (performance reporting): The abstract and results report concrete median and quartile surface distances (e.g., 1.24 mm for RF adaptation) but supply no information on train-test splits, cross-validation, hyperparameter search, or statistical testing of differences between models. This is load-bearing for the central claim that RF yields the lowest error.
  2. [Introduction / Methods] Introduction and Methods (ground-truth definition): The claim that the AI approaches advance standardization of socket design rests on treating the single prosthetist-designed socket per patient as the target. No inter-prosthetist agreement statistics, repeated designs on the same limbs, or linkage to fit comfort/functional outcomes are reported. If typical inter-expert surface distances exceed the reported AI error, the numerical result shows replication of a particular design rather than reduction of variability.
minor comments (2)
  1. [Abstract] Abstract: The description of the 118-patient dataset and the two prediction strategies (direct vs. adaptation) could be stated more explicitly to clarify the experimental design before the performance numbers.
  2. [Figures] Figure clarity: Distance maps are mentioned for error location analysis; ensure color scales and anatomical landmarks are labeled consistently across examples to allow readers to interpret regional error patterns.

Simulated Author's Rebuttal

2 responses · 1 unresolved

We thank the referee for the constructive feedback on validation details and the interpretation of our standardization claims. We respond point-by-point to the major comments below.

read point-by-point responses
  1. Referee: [Results] Results section (performance reporting): The abstract and results report concrete median and quartile surface distances (e.g., 1.24 mm for RF adaptation) but supply no information on train-test splits, cross-validation, hyperparameter search, or statistical testing of differences between models. This is load-bearing for the central claim that RF yields the lowest error.

    Authors: We agree that explicit details on the experimental protocol are necessary to support the performance claims. The revised manuscript will add a dedicated 'Model Training and Evaluation' subsection in Methods that reports the train-test partitioning strategy (including any patient-level stratification to avoid leakage), cross-validation procedure if employed, hyperparameter selection method, and statistical comparisons (e.g., Wilcoxon signed-rank tests with correction for multiple comparisons) between the three models on the surface-to-surface distance metric. These additions will directly substantiate the reported superiority of the random forest adaptation model. revision: yes

  2. Referee: [Introduction / Methods] Introduction and Methods (ground-truth definition): The claim that the AI approaches advance standardization of socket design rests on treating the single prosthetist-designed socket per patient as the target. No inter-prosthetist agreement statistics, repeated designs on the same limbs, or linkage to fit comfort/functional outcomes are reported. If typical inter-expert surface distances exceed the reported AI error, the numerical result shows replication of a particular design rather than reduction of variability.

    Authors: We accept the critique on the scope of the standardization claim. Our 118-patient dataset contains one expert-designed socket per residual limb, and the models learn to predict the adaptations performed in that clinical context. In the revision we will (i) rephrase the Introduction and Discussion to state that the work demonstrates accurate replication of prosthetist adaptations rather than reduction of inter-expert variability, and (ii) add an explicit Limitations paragraph noting the absence of multi-prosthetist repeated designs and clinical outcome linkage, while recommending these as directions for future multi-center studies. revision: partial

standing simulated objections not resolved
  • Absence of repeated socket designs by multiple prosthetists on the same limbs prevents computation of inter-expert surface distance statistics from the current dataset.

Circularity Check

0 steps flagged

No significant circularity; standard supervised ML evaluation on external dataset.

full rationale

The paper trains models (3D NN, FFNN, random forest) on 118-patient data to predict either final socket shape or prosthetist adaptations from residual-limb scans, then evaluates surface-to-surface distance against the held-out prosthetist-designed sockets. No equations, ansatzes, or self-citations are shown that reduce the reported median error (1.24 mm) to a fitted parameter or input by construction. Pre-processing (alignment, Morphable Models, PCA) is standard and does not create self-definitional loops. The central claim remains an empirical performance number on independent test cases rather than a tautology.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 0 invented entities

The central claim rests on the assumption that surface-to-surface distance is a sufficient proxy for clinical socket quality and that the collected prosthetist designs are representative targets. Standard ML preprocessing (alignment, PCA compression) introduces tunable parameters whose values are not reported.

free parameters (2)
  • PCA compression level
    Number of principal components retained after Morphable Model compression; chosen to balance fidelity and dimensionality.
  • Random forest hyperparameters
    Number of trees, depth, and feature sampling parameters tuned on the training portion of the 118-patient set.
axioms (1)
  • domain assumption Prosthetist-designed sockets represent the desired standardized output
    Invoked when treating the collected sockets as ground truth for both direct prediction and adaptation targets.

pith-pipeline@v0.9.0 · 5869 in / 1281 out tokens · 31993 ms · 2026-05-22T17:43:36.528425+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

26 extracted references · 26 canonical work pages · 1 internal anchor

  1. [1]

    URL: https://www.who.int/publications/i/item/ 9789241512480

    WHO & USAID, Health Product Policy and Standards, Who standards for prosthetics and orthotics, www.who.int (2017). URL: https://www.who.int/publications/i/item/ 9789241512480

  2. [2]

    URL: https://www

    Limbs international, Why limbs (Date accessed: 26-03-2025). URL: https://www. limbsinternational.org/why-limbs.html

  3. [3]

    J. S. Jensen, W. Raab, J. R. Fisk, C. Hartz, A. Saldana, C. Harte, Quality of polypropy- lene sockets for Trans-Tibial prostheses in Low-Income countries, Prosthetics and Orthotics International 30 (2006) 45–59. doi: 10.1080/03093640600568336

  4. [4]

    Safari, M

    R. Safari, M. R. Meier, Systematic review of effects of current transtibial prosthetic socket designs—part 1: Qualitative outcomes, Journal of Rehabilitation Research and Development (2015)

  5. [5]

    C. W. Radcliffe, The patellar-tendon-bearing below-knee prosthesis, Biomechanics Laboratory (1961)

  6. [6]

    T. B. Staats, J. Lundt, The ucla total surface bearing suction below-knee prosthesis, Clinical prosthetics and orthotics 11 (1987)

  7. [7]

    Safari, Lower limb prosthetic interfaces: Clinical and technological advancement and po- tential future direction, Prosthetics and orthotics international 44 (2020) 384–401

    R. Safari, Lower limb prosthetic interfaces: Clinical and technological advancement and po- tential future direction, Prosthetics and orthotics international 44 (2020) 384–401

  8. [8]

    Colombo, G

    G. Colombo, G. Facoetti, C. Rizzi, Automatic below-knee prosthesis socket design: a pre- liminary approach, in: Digital Human Modeling: Applications in Health, Safety, Ergonomics and Risk Management: 7th International Conference, DHM 2016, Held as Part of HCI In- ternational 2016, Toronto, ON, Canada, July 17-22, 2016, Proceedings 7, Springer, 2016, pp. 75–81

  9. [9]

    S. Li, H. Lan, X. Luo, Y. Lv, L. Gao, H. Yu, Quantitative compensation design for prosthetic socket based on eigenvector algorithm method, Review of Scientific Instruments 90 (2019)

  10. [10]

    Dickinson, L

    A. Dickinson, L. Diment, R. Morris, E. Pearson, D. Hannett, J. Steer, Characterising residual limb morphology and prosthetic socket design based on expert clinician practice, Prosthesis 3 (2021) 280–299

  11. [11]

    A. G. Cutti, M. G. Santi, A. H. Hansen, S. Fatone, R. L. S. C. Group, Prosthetist-specific recti- fication templates based on artificial intelligence for the digital fabrication of custom transtibial sockets, Prosthesis 6 (2024) 1149–1169. 15

  12. [12]

    van der Stelt, B

    M. van der Stelt, B. Berends, M. Papenburg, T. Langenhuyzen, T. Maal, L. Brouwers, G. de Jong, R. Leijendekkers, Evaluating the effectiveness of transtibial prosthetic socket shape design using artificial intelligence: A clinical comparison with traditional plaster cast socket designs, Archives of physical medicine and rehabilitation (2024)

  13. [13]

    Sharp, S

    N. Sharp, S. Attaiki, K. Crane, M. Ovsjanikov, Diffusionnet: Discretization agnostic learning on surfaces, ACM Transactions on Graphics (TOG) 41 (2022) 1–16

  14. [14]

    Y. Li, R. Bu, M. Sun, W. Wu, X. Di, B. Chen, Pointcnn: Convolution on x-transformed points, Advances in neural information processing systems 31 (2018)

  15. [15]

    C. R. Qi, L. Yi, H. Su, L. J. Guibas, Pointnet++: Deep hierarchical feature learning on point sets in a metric space, Advances in neural information processing systems 30 (2017)

  16. [16]

    J. D. White, A. Ortega-Castrill´ on, H. Matthews, A. A. Zaidi, O. Ekrami, J. Snyders, Y. Fan, T. Penington, S. Van Dongen, M. D. Shriver, et al., Meshmonk: Open-source large-scale intensive 3d phenotyping, Scientific reports 9 (2019) 6085

  17. [17]

    C. R. Qi, H. Su, K. Mo, L. J. Guibas, Pointnet: Deep learning on point sets for 3d classification and segmentation, in: Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, pp. 652–660

  18. [18]

    van der Stelt, T

    M. van der Stelt, T. J. J. Maal, C. H. E. Jordaan, Transtibial prosthetic socket shape design using artificial intelligence, Radboud Data Repository, 2024. doi: 10.34973/hcq4-9296

  19. [19]

    Dickinson, J

    A. Dickinson, J. Steer, C. Rossides, L. Diment, F. Mbithi, J. Bramley, D. Hannett, J. Blinova, Z. Tankard, P. Worsley, Insights into the spectrum of transtibial prosthetic socket design from expert clinicians and their digital records, Frontiers in Rehabilitation Sciences 5 (2024) 1354069

  20. [20]

    URL: https://www

    Python Software Foundation, Python programming language, 2019. URL: https://www. python.org/

  21. [21]

    Bermejo, K

    E. Bermejo, K. Taniguchi, Y. Ogawa, R. Martos, A. Valsecchi, P. Mesejo, O. Ib´ a˜ nez, K. Imaizumi, Automatic landmark annotation in 3d surface scans of skulls: Methodological proposal and reliability study, Computer Methods and Programs in Biomedicine 210 (2021) 106380

  22. [22]

    PyTorch 2: Faster Machine Learning Through Dynamic Python Bytecode Transforma- tion and Graph Compilation

    J. Ansel, E. Yang, H. He, N. Gimelshein, A. Jain, M. Voznesensky, B. Bao, P. Bell, D. Be- rard, E. Burovski, G. Chauhan, A. Chourdia, W. Constable, A. Desmaison, Z. DeVito, E. El- lison, W. Feng, J. Gong, M. Gschwind, B. Hirsh, S. Huang, K. Kalambarkar, L. Kirsch, M. Lazos, M. Lezcano, Y. Liang, J. Liang, Y. Lu, C. Luk, B. Maher, Y. Pan, C. Puhrsch, M. Re...

  23. [23]

    D. P. Kingma, J. Ba, Adam: A method for stochastic optimization, 2015. arXiv:1412.6980. 16

  24. [24]

    Biewald, Experiment tracking with weights and biases, 2020

    L. Biewald, Experiment tracking with weights and biases, 2020. URL: https://www.wandb. com/, software available from wandb.com

  25. [25]

    Yan, Pointnet/pointnet++ pytorch, 2019

    X. Yan, Pointnet/pointnet++ pytorch, 2019. URL: https://github.com/yanx27/Pointnet_ Pointnet2_pytorch

  26. [26]

    A. S. Dickinson, M. K. Donovan-Hall, S. Kheng, K. Bou, A. Tech, J. W. Steer, C. D. Metcalf, P. R. Worsley, Selecting appropriate 3d scanning technologies for prosthetic socket design and transtibial residual limb shape characterization, JPO: Journal of Prosthetics and Orthotics 34 (2022) 33–43. Appendix A. Pre-processing Figure A.10: An example of a 3D sc...