论文标题
集成光子FFT,用于用于高效和高速神经网络的光学卷积
Integrated Photonic FFT for Optical Convolutions towards Efficient and High-Speed Neural Networks
论文作者
论文摘要
从深度学习系统中执行的数据提取的技术与技术相关的任务通常是在电流特定的域特异性体系结构中以电子方式重复的快速傅立叶变换(FFT)来完成的,例如图形处理单元(GPU)。但是,由于与互连电容相关的电线充电挑战,电子系统在功率耗散和延迟方面受到限制。在这里,我们为卷积神经网络提供了基于硅光子学的体系结构,该体系结构利用光的相位特性通过在傅立叶域中执行卷积作为乘法来有效地执行FFT。算法执行时间取决于信号的飞行时间通过此光子可重构的无源FFT滤波器电路,并且在picsecond的10s阶。灵敏度分析表明,该光学处理器必须热相稳定,对应于几个度。此外,我们发现,对于较小的样本号,每项{时芯片面积)的可获得数量比GPU的大约2个数量级优于GPU。最后,我们表明,从概念上讲,光学FFT和卷积加工性能确实与光电设备级直接相关,并且在等离子体,超材料或纳米光子学的改进中,它促进了下一代与边缘互联的智能光子电路与边缘相关的5G 5G网络的相关性。
The technologically-relevant task of feature extraction from data performed in deep-learning systems is routinely accomplished as repeated fast Fourier transforms (FFT) electronically in prevalent domain-specific architectures such as in graphics processing units (GPUs). However, electronics systems are limited with respect to power dissipation and delay, both, due to wire-charging challenges related to interconnect capacitance. Here we present a silicon photonics-based architecture for convolutional neural networks that harnesses the phase property of light to perform FFTs efficiently by executing the convolution as a multiplication in the Fourier-domain. The algorithmic executing time is determined by the time-of-flight of the signal through this photonic reconfigurable passive FFT filter circuit and is on the order of 10s of picosecond. A sensitivity analysis shows that this optical processor must be thermally phase stabilized corresponding to a few degrees. Furthermore, we find that for a small sample number, the obtainable number of convolutions per {time-power-chip area) outperforms GPUs by about 2 orders of magnitude. Lastly, we show that, conceptually, the optical FFT and convolution-processing performance is indeed directly linked to optoelectronic device-level, and improvements in plasmonics, metamaterials or nanophotonics are fueling next generation densely interconnected intelligent photonic circuits with relevance for edge-computing 5G networks.