论文标题
使用多波长衍射深神经网络的光学多任务学习
Optical multi-task learning using multi-wavelength diffractive deep neural networks
论文作者
论文摘要
光子神经网络是使用光子而不是电子来执行人工智能(AI)任务的脑启发的信息处理技术。但是,现有的体系结构是为单个任务而设计的,但由于任务竞赛恶化了模型性能,因此在单个整体系统中未能在单个整体系统中并行多重任务。本文通过使用关节优化方法设计多波长衍射深神经网络(D2NN),提出了一种新型的光学多任务学习系统。通过将多任务输入编码到多波长通道中,系统可以增加计算吞吐量,并显着避免竞争,以高精度并行执行多个任务。我们分别设计了具有两个和四个频谱通道的两任任务和四任任务D2NN,用于对MNIST,FMNIST,KMNIST和EMNIST数据库的不同输入进行分类。数值评估表明,在相同的网络大小下,MUL-TI波长D2NN与单波长D2NN相比,多任务学习的分类精度明显更高。此外,通过增加网络大小,用于执行多个任务的多波长D2NN达到了可比较的分类精度,相对于多个单波长D2NN的个别训练,可以分别执行任务。我们的工作为开发波长划分多路复用技术的方式铺平了道路,以实现高通量神经形态光子计算和更通用的AI系统,以并行执行多个任务。
Photonic neural networks are brain-inspired information processing technology using photons instead of electrons to perform artificial intelligence (AI) tasks. However, existing architectures are designed for a single task but fail to multiplex different tasks in parallel within a single monolithic system due to the task competition that deteriorates the model performance. This paper proposes a novel optical multi-task learning system by designing multi-wavelength diffractive deep neural networks (D2NNs) with the joint optimization method. By encoding multi-task inputs into multi-wavelength channels, the system can increase the computing throughput and significantly alle-viate the competition to perform multiple tasks in parallel with high accuracy. We design the two-task and four-task D2NNs with two and four spectral channels, respectively, for classifying different inputs from MNIST, FMNIST, KMNIST, and EMNIST databases. The numerical evaluations demonstrate that, under the same network size, mul-ti-wavelength D2NNs achieve significantly higher classification accuracies for multi-task learning than single-wavelength D2NNs. Furthermore, by increasing the network size, the multi-wavelength D2NNs for simultaneously performing multiple tasks achieve comparable classification accuracies with respect to the individual training of multiple single-wavelength D2NNs to perform tasks separately. Our work paves the way for developing the wave-length-division multiplexing technology to achieve high-throughput neuromorphic photonic computing and more general AI systems to perform multiple tasks in parallel.