| Issue |
MATEC Web Conf.
Volume 414, 2025
9th Scientific and Technical Days in Mechanics and Materials: Innovative Materials and Processes for Industrial and Biomedical Applications (JSTMM 2024)
|
|
|---|---|---|
| Article Number | 04007 | |
| Number of page(s) | 9 | |
| Section | Mechanical Design, Modeling & Manufacturing Processes | |
| DOI | https://doi.org/10.1051/matecconf/202541404007 | |
| Published online | 02 October 2025 | |
Automated 3D Reconstruction of Mechanical Components Using Multimodal Deep Learning
Mechanical, Production and Energy Laboratory (LMPE), National School of Engineering of Tunis (ENSIT), University of Tunis, 1008, Avenue Taha Hussein, Montfleury, Tunisia
* e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.
** e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.
*** e-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.
Abstract
This paper presents a multimodal deep learning approach for precise 3D reconstruction of mechanical parts from images and textual descriptions, offering a cost-effective alternative to traditional methods. By combining high-resolution, multi-angle images with technical text data, the model generates accurate 3D representations. A ResNet-based CNN extracts visual features, while BERT encodes textual descriptions, with a depth estimation module enhancing spatial accuracy. The features are fused to produce a 3D point cloud and mesh. The results demonstrate good performance in capturing the overall shape of the mechanical parts; however, further improvements are needed to enhance the precision of the metric parameters.
Key words: 3D reconstruction / multimodal deep learning / computer vision / mechanical modeling / depth estimation
© The Authors, published by EDP Sciences, 2025
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.

