面向自动驾驶的多模态信息融合动态目标识别

doi:10.11835/j.issn.1000.582X.2024.04.012

首页 > 过刊浏览>2024年第47卷第4期 >139-156. DOI:10.11835/j.issn.1000.582X.2024.04.012

面向自动驾驶的多模态信息融合动态目标识别
DOI:
                        10.11835/j.issn.1000.582X.2024.04.012
                    
CSTR:
                        [cstr]
                    
作者:
                        
                        
                    
作者单位:1.广东轻工职业技术学院 汽车技术学院，广州 510000;2.广汽埃安新能源汽车股份有限公司研发中心，广州 511400;3.华南理工大学 机械与汽车工程学院，广州 510641;4.广州城市理工学院 工程研究院，广州 510800
作者简介:张明容（1983—），女，博士，副教授，主要从事智能网联汽车方向研究,（E-mail）153155269@qq.com。
通讯作者:喻皓，男，高级工程师，（E-mail）yuhao@gacne.com.cn。
中图分类号:
基金项目:国家自然科学基金资助项目（51975217）。

Multimodal information fusion dynamic target recognition for autonomous driving

Author:

Affiliation:

1.School of Automotive Technology, Guangdong Industry Polytechnic, Guangzhou 510000,P. R. China;2.GAC AION New Energy Automobile Co., Ltd., Guangzhou 511400, P. R. China;3.School of Mechanical & Automotive Engineering, South China University of Technology,Guangzhou 510641, P. R. China;4.Engineering Research Institute, Guangzhou City;University of Technology, Guangzhou 510800, P. R. China

Fund Project:

Supported by National Natural Science Foundation of China（51975217）.

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

研究提出一种面向自动驾驶的多模态信息融合的目标识别方法，旨在解决自动驾驶环境下车辆和行人检测问题。该方法首先对ResNet50网络进行改进，引入基于空间注意力机制和混合空洞卷积，通过选择核卷积替换部分卷积层，使网络能够根据特征尺寸动态调整感受野的大小；然后，卷积层中使用锯齿状混合空洞卷积，捕获多尺度上下文信息，提高网络特征提取能力。改用GIoU损失函数替代YOLOv3中的定位损失函数，GIoU损失函数在实际应用中具有较好操作性；最后，提出了基于数据融合的人车目标分类识别算法，有效提高目标检测的准确率。实验结果表明，该方法与OFTNet 、VoxelNet 和FasterRCNN网络相比，在mAP指标白天提升幅度最高可达0.05，晚上可达0.09，收敛效果好。

Abstract:

A multi-modal information fusion based object recognition method for autonomous driving is proposed to address the vehicle and pedestrian detection challenge in autonomous driving environments. The method first improves ResNet50 network based on spatial attention mechanism and hybrid null convolution. The standard convolution is replaced by selective kernel convolution, which allows the network to dynamically adjust the size of the perceptual field according to the feature size. Then, the sawtooth hybrid null convolution is used to enable the network to capture multi-scale contextual information and improve the network feature extraction capability. The localization loss function in YOLOv3 is replaced with the GIoU loss function, which has better operability in practical applications. Finally, human-vehicle target classification and recognition algorithm based on two kinds of data fusion is proposed, which can improve the accuracy of the target detection. Experimental results show that compared with OFTNet, VoxelNet and FASTERRCNN, the mAP index can be improved by 0.05 during daytime and 0.09 in the evening, and the convergence effect is good.

参考文献

相似文献

引证文献

引用本文

张明容,喻皓,吕辉,姜立标,李利平,卢磊.面向自动驾驶的多模态信息融合动态目标识别[J].重庆大学学报,2024,47(4):139-156.

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2023-05-12
最后修改日期:
录用日期:
在线发布日期: 2024-05-06
出版日期:

期刊社主页

编辑部首页

期刊介绍

编委会

数据库收录

过刊浏览

联系我们

引用本文

分享

文章指标

历史

文章二维码