Abstract
With the exponential growth of artificial intelligence-driven data centre traffic, next-generation data centre optical interconnects must deliver high-speed data transmission while ensuring low latency and power consumption. Here, we present an ultra-simple low-latency self-homodyne coherent interconnect solution through anti-resonant hollow core fibre and leverages the Fermat number transform to implement the entire digital signal processing. The Fermat number transform eliminates the round-off errors prevalent in the fast Fourier transform through modulo operations and replaces computationally intensive multiplications with simple cyclic shift and addition operations. As a proof of concept, we demonstrate bidirectional transmission through a 5.1-km anti-resonant hollow core fibre, achieving a data rate of 448 Gb·s-1. Our proposed scheme reduces complexity of digital signal processing by 90%, whereas the integration of the anti-resonant hollow core fibre reduces the propagation latency by 28.4%. This work establishes a promising path to push the energy-efficiency boundary of coherent structure and enables large scale deployment of coherent optical interconnects.
Similar content being viewed by others
Introduction
Data centres have emerged as essential infrastructures, driven by the explosive growth of Internet of vehicles, cloud services and artificial intelligence-generated content applications based on large language models in recent years1,2. These applications have led to a large amount of data traffic on data centre networks (DCNs); intra-data centre traffic typically spanning <2 km accounts for more than 70% of the total internet traffic. To meet the escalating demands for computing power (Fig. 1a)3,4, local data centre interconnects (DCIs) are needed to connect to nearby edge data centres within 10 km, forming a super data centre. These developments impose stringent trade-offs and challenges on next-generation DCIs in terms of speed, power consumption, and latency. Emerging high-capacity DC optical interconnects within a speed of 1.6 Tb·s−1/3.2·Tb s−1 and a distance range of 500 m to 10 km will soon become the mainstream solution to support large-scale data interactions5. However, these large-scale, latency-sensitive applications further intensify the requirements for the power consumption and ultra-low link latency of DCIs6. Moreover, the fibre and spectrum resources involved in DCs are limited, necessitating spectrally efficient bidirectional (BiDi) transmission at a single wavelength.
This scheme is designed to address the power consumption and latency challenges faced by conventional coherent technologies in next-generation intra-/inter data centre networks. a Intra-/inter data centre optical networks undertake large amounts of data traffic, supporting the internet of vehicles, cloud services, and AIGC training and application processes. b Architecture of conventional single-carrier coherent technology in DCNs. c Power consumption distribution of the coherent optical module. Others mainly involve power conversion loss. d Link latency distribution of the conventional coherent optical module using SSMF. Frame time: the time used to transmit and receive a MAC frame; Switch: Ethernet switches. e The proposed FNT-based transmitter (Tx) DSP strategy. f The proposed FNT-based receiver (Rx) DSP strategy. The SHCD structure eliminates FOC and CPR algorithms. The use of NANF and DSCM technologies eliminates the CDC block and unlocks the FNT-based AEQ algorithm. By integrating FNT into Nyquist filtering and AEQ blocks, the ultra-simple DSP flow is achieved. g The FNT-SHCD architecture for bidirectional transmission: a bias point offset is adopted to insert the LO component into the guard interval of the DSCM signal, and FBG and OIL are used to achieve LO regeneration. h The internal microstructure of the NANF with five annular glass tubes. FNT: Fermat number transform; AIGC: artificial intelligence-generated content; DCN: data centre network; DP IQ Mod.: dual polarization in-phase/quadrature modulator; SSMF standard single-mode fibre, LO local oscillator, ICR integrated coherent receiver, DSP digital signal processing, FEC forward error correction, TIA trans-impedance amplifier, ADC analogue-to-digital converter, DAC digital-to-analogue converter, MAC media access control, SHCD self-homodyne coherent detection, FOC frequency offset compensation, CPR carrier phase recovery, NANF nested anti-resonant nodeless fibre, DSCM digital subcarrier multiplexing, CDC chromatic dispersion compensation, AEQ adaptive equalization, PRBS pseudo-random binary sequence, SRRCF square root raised cosine filtering, SRRCMF square root raised cosine matched filtering, DSC Mux/Demux digital signal subcarrier multiplexing/demultiplexing, FBG fibre Bragg grating, OIL optical injection locking, DFB distributed feedback.
Owing to its relatively high receiver sensitivity and spectral efficiency (SE), as well as its ability to compensate for various types of damage, the conventional coherent structure (Fig. 1b) exhibits improved tolerance for fibre propagation impacts7,8,9, making it a promising solution for future DCIs. However, the high latency associated with conventional single-mode fibre (SSMF) and power-hungry complex digital signal processing (DSP) technology remains unacceptable for cost and time-sensitive data centre optical interconnects, hindering the deployment of coherent modules in real short-reach applications. The power consumption and link latency distributions of the coherent optical module are shown in Fig. 1c and d, where the DSP alone consumes half of the power consumption and the propagation time consumes nearly half of the link latency10,11,12. Therefore, innovations in converged transmission architectures, advanced DSP techniques, and fibre hardware are imperative to mitigate the power consumption and latency challenges of traditional coherent technology.
Recently, several BiDi self-homodyne coherent detection (SHCD) transmission schemes have been proposed to reduce the power consumption levels and cost6,13,14,15,16. By transmitting the LO along with the signal, the requirement for high-cost narrow-linewidth lasers is relaxed. However, these schemes require additional fibres or polarization dimensions to transmit the LO, and the wavelengths of the two BiDi transmission directions need to be staggered, which wastes valuable fibre resources and reduces SE. Moreover, these conventional SSMF-based architectures neglect link latency optimization, leading to suboptimal performance in latency-sensitive applications.
For DSP simplification, SHCD systems eliminate the need for carrier recovery, including frequency offset compensation (FOC) and carrier phase recovery (CPR), leaving chromatic dispersion compensation (CDC), adaptive equalization (AEQ), and Nyquist filtering (NF) as the dominant power consumed by DSP due to their convolutional structures. The conventional simplified DSP approach is mainly based on optimizing the structures of algorithms17,18. Although these methods have been proven effective, they have now reached a point where further progress is difficult to achieve. The Fermat number transform (FNT) has attracted considerable attention, owing to its ability to achieve fixed-point convolution without any round-off error via modulus constraint and reduce hardware complexity via circular shift operations19,20,21. Therefore, the FNT provides an efficient path for the implementation of DSP in application-specific integrated circuits (ASICs) and field-programmable gate arrays (FPGAs), greatly reducing the power consumption level. Despite these advantages, the inherent limitations in bit-width and transform length currently hinder the direct deployment of FNT in conventional coherent DSP systems, making innovations in transmission architecture necessary for practical implementation of the FNT-based DSP strategy. The two-dimensional FNT (2D-FNT), which extends the transformation length, is widely used to mitigate the inter-symbol interference (ISI) introduced by CD, especially for single-carrier modulations with speeds above 800 Gb s−1 in standard single-mode fibre (SSMF), where an extended transformation length is needed. However, compared with FNT, the 2D-FNT introduces additional hardware complexity overhead and overflow risk, resulting in critical and irreversible computational errors.
In this work, we demonstrate an ultra-simple DSP entirely based on FNT through a simplified and low-latency BiDi transmission coherent structure at a single wavelength, empowered by the adoption of nested anti-resonant nodeless fibre (NANF) and SHCD structure. NANF, as a type of anti-resonant hollow core fibre (AR-HCF), relies on a fundamentally different light guiding mechanism that simultaneously eliminates CD and the refractive index limitations of SSMF22,23,24, and has become a breakthrough solution to simplify DSP and reduce propagation latency25. In addition, NANF exhibits negligible nonlinearity and Rayleigh scattering compared with solid-core SSMF26,27,28, enabling high launch power BiDi transmission at the same wavelength in a single fibre. To form a spectrally efficient SHCD structure, we adopt a simple bias offset method called residual carrier modulation (RCM)29 to generate a direct-current frequency-domain pilot tone (DC-FPT). The signal is distributed in the form of a digital subcarrier multiplexing (DSCM) signal on both sides of the DC-FPT to provide a sufficient guard interval (GI) to prevent interference between the DC-FPT and the signal. The receiver separates a small amount of the signal and deploys a fibre Bragg grating (FBG) filter to obtain pure DC-FPT components. Therefore, by utilizing a cost-effective distributed feedback (DFB) laser for optical injection locking (OIL), a high-quality LO is regenerated, which is homologous to the signal. By leveraging the aforementioned spectrally efficient SHCD architecture and advanced hollow-core fibre technology, we achieve, a DSP implementation entirely based on the hardware-efficient FNT (Fig. 1e and f). Therefore, we designate the holistic co-design of FNT-DSP, SHCD architecture, and AR-HCF as FNT-SHCD. In addition to the necessary forward error correction (FEC) block, the demonstrated design achieves a 90% reduction in the computational complexity of DSP without compromising transmission performance. Furthermore, experimental validation via bidirectional transmission over a single wavelength demonstrates a 28.4% decrease in the end-to-end latency and a 1.27-dB improvement in the receiver sensitivity. This work establishes a benchmark for next-generation high-speed, low-power, and low-latency optical interconnects in the age of artificial intelligence.
Results
FNT-SHCD architecture
The core of the FNT-SHCD structure lies in the slight bias offset of the modulator in the transmitter and the OIL process of the receiver (Fig. 1g). By using DSCM technology, a DC-FPT is inserted between signals located within a certain protection interval. The DC-FPT generation process relies mainly on slightly deviating the bias voltage of the modulator from the null point (Pnull). Since the DC-FPT is used for LO regeneration in this work, the carrier-to-signal power ratio (CSPR) is used in this article instead of the pilot-to-signal power ratio (PSR) for characterizing the magnitude relationship between the DC-FPT and the signal power.
Therefore, the DC-FPT and the DSCM signal, which originate from the same laser and propagate along the same optical path, are generated and transmitted to the receiver simultaneously. By deploying OIL, a narrow-linewidth LO can be regenerated. The regenerated LO and signal reach the receiver simultaneously by adjusting the variable optical delay line (VODL). Thus, we achieve SHCD, which eliminates the influences of frequency offsets in the signal and simplifies or even removes the carrier phase recovery algorithm in the DSP. In FNT-SHCD, the signal is located on both sides of the LO in the form of DSCM, which can fully utilize the spectrum resources. Conventionally, spectral separation between the residual optical carrier and signal components induces chromatic-dispersion-driven propagation delay mismatch in higher-order subcarriers, thereby increasing their phase noise. However, the combined effect of the narrow-linewidth characteristic of the regenerated-LO and the low-chromatic-dispersion characteristic of AR-HCF renders the performance degradation negligible (see Supplementary Discussion 1 and Supplementary Fig. 1). Compared with polarization division multiplexing SHCD (PDM-SHCD)13 and spatial division multiplexing SHCD (SDM-SHCD)30, it has a relatively high SE and does not require additional fibres. Compared with the conventional PDM-SHCD scheme, our proposed FNT-SHCD (with a bandwidth of 60 GHz and a GI of 4 GHz) achieves a higher SE of 86% under the dual polarization 16-quadrature amplitude modulation (16-QAM) format, which is equivalent to that of SDM-SHCD when a 15-core multi-core fibre is used.
NANF enables low-latency BiDi transmission at a single wavelength
In SSMF, owing to the influence of backscattering, BiDi transmission often operates at different wavelengths. NANF has been proven to have low Rayleigh scattering and nonlinearity26,27,31. Therefore, NANF can achieve BiDi transmission at a single wavelength without the requirement of an additional complex wavelength scheduling design.
The internal microstructure of the NANF, which has five annular glass tubes, forms a double-layer nested structure (Fig. 1h). By using a reverse-tapering method and optimizing all splicing parameters, a low splicing loss is achieved between the NANF and SSMF. The sum of the splicing loss and 5.1-km fibre transmission loss is 4.1 dB.
Owing to the use of NANF, the latency of propagation is greatly reduced32. Through optical time-domain reflectometer (OTDR) technology, we obtain the temporal traces of the NANF and SSMF to compare the propagation latency. We compress the time axis to half of the real-time duration to reveal the actual propagation latency. To increase the signal-to-noise ratio (SNR), both traces are averaged 64 times, enabling accurate identification of the peaks corresponding to the front and rear ends of the fibres, as depicted in Fig. 2. The Rayleigh backscattering (RBS) in the NANF is relatively low, so here, we only measure the reflection signal from the end face to determine the propagation latency. By integrating NANF into optical interconnects, a lower RBS signal and a 28.4% lower propagation latency than those of SSMF are achieved.
FNT-based DSP strategy
The number theoretic transform (NTT), as an advanced DSP technique, can achieve convolution results without any round-off error by utilizing the modulo operation. The NTT is defined on the finite integer field, which is realized by the modulo operation on the calculation results. The above transformation has cyclic convolution properties to replace the convolution characteristics of the FFT (Fig. 3a) in the system, if the modulus F and radix α satisfy the specific expression33 (see the “Methods” section for the selection of the modulus F and radix α). When the radix of the transform α = 2m and m is a positive integer, all complex multiplication operations in the NTT can be replaced with circular shifts that are more efficient for computers or application-specific integrated circuits (ASICs). In addition, the modulo operation implemented under Fermat numbers can be achieved through addition. The NTT with the aforementioned characteristics is referred to as the Fermat number transform (FNT), which can be implemented via only addition and cyclic shift operations, eliminating the need for explicit multiplication. Notably, multiplications can be replaced by circular shifts and addition operations when \(\sqrt{2}\) is used as the radix of the FNT. During the transformation process, all the data values are confined by using modulo operations; thus, the FNT eliminates the round-off errors that are prevalent in the FFT.
a The radix of the FNT is circularly distributed in the finite integer domain, due to the limitation of the modulo operation. Where \(F\) represents the modulus value and \(\alpha\) represents the radix of the FNT. b The radix of the discrete Fourier transform in the domain of complex-valued number. c The FNDE architecture is based on the FNT. The real part and imaginary part of the complex-valued sequence are separated and multiplied by the filter taps in the Fermat number domain to implement convolution. d The structure of FNT-based adaptive equalization (AEQ). FNT Fermat number transform, DSP digital signal processing, FNDE Fermat number domain equalization.
Therefore, a low-complexity FNT with cyclic convolution properties is defined over a finite integer field (Fig. 3b). In this field, cyclic convolutions are implemented efficiently via hardware-friendly cyclic shift operations without any round-off errors.
However, the FNT faces great challenges in the implementation of conventional DSP in the coherent optical communication structure. Owing to the limitations of the finite integer field, overflow occurs once the convolution result exceeds the modulus Fq33. The mutual limitation between the bit-width and length of the transform must be considered. The relationship between N, the signal sequence x(n) and the tap sequence h(n) is given as follows19:
where bwh and bwx are the bit-widths of the tap and signal sequences. A low bit-width introduces severe quantization errors. Therefore, it is necessary to determine the required bit-width bw and transform length N before integrating the FNT into the algorithms.
The transform length N is also constrained by inequality (1), making it difficult for FNT to support a coherent DSP strategy. With respect to the modulus values F5 and F6 (Fn represents \({2}^{{2}^{n}}+1\)), the bit-width in the FNT is too large for DSP (F5 and F6 require bit-widths of 32 and 64, respectively), whereas F4 cannot support convolutions with long tap lengths. Therefore, for convolutions with lengths of 128 or longer, the 2D-FNT with modulus F4 is often used. Our previous work19,20 achieved a 256-length convolution via 2D-FNT. Although it provides benefits, the 2D-FNT is still more complex than the FNT. Moreover, owing to the increased convolution length of the 2D-FNT, overflow is more likely to occur, leading to irreversible computational errors.
We implement an ultra-low complexity FNT-based transmitter (Tx) DSP and receiver (Rx) DSP strategies (Fig. 1e and f) through the FNT-SHCD structure. In the conventional DSP strategy, NF (square root raised cosine (SRRC) filtering) is widely used to achieve transmission without ISI, reduce the spectral width of the signal and simultaneously suppress out-of-band noise. Subsequent to the NF block, a static long-tap equalizer is used for the CDC, and then a short-tap AEQ is used for polarization demultiplexing, compensating for the ISI and residual CD. The implementation of the conventional NF, CDC and AEQ blocks relies on time-domain equalization (TDE) based on the finite impulse response (FIR) and frequency-domain equalization (FDE) filters based on the FFT34. Although FDE has a lower complexity than TDE for convolutions with long tap lengths, the complexity is still relatively high. Here, we integrate the FNT into both the Tx and Rx DSP strategies by utilizing the structure of Fermat number domain equalization (FNDE) (Fig. 3c). We use an overlap structure to truncate long sequences into short sequences for NF and AEQ.
Notably, the signal is divided into multiple subcarriers in our proposed scheme, thus reducing the baud rate compared with that of the single-carrier scheme. By combining the above method with the use of NANF, the ISI caused by CD is greatly suppressed. Therefore, the AEQ block can deploy the FNT-based AEQ structure19 without the CDC (Fig. 3d). Moreover, owing to the use of OIL to perfectly regenerate LO at the receiver, FOC and CPR are eliminated after the LO is aligned with the time delay of the signal path. Therefore, the introduction of the various technologies mentioned above unlocks ultra-simple DSP flow.
Numerical simulation of the FNT DSP strategy
A numerical simulation is conducted to determine the optimal bit-width for SRRC filtering (the detailed simulation setup is shown in Supplementary Note 1, Supplementary Fig. 2 and Supplementary Table 1). We use the coherent optical communication structure in VPI (VPIphotonics Design Suite v9.9) to transmit 224 (two subcarriers, 2 × 112)-gigabauds (GBaud) DSCM dual-polarization 16-QAM signals for numerical simulation. The fibre length is set to 10 km, the CD coefficient is set to be 4 ps nm−1 km−1, matched with NANF. We use the concatenated staircase forward error code (C-FEC) with a bit error rate (BER) threshold of 1.21 × 10−2 (14.8% overhead35), which employs a good trade-off between power consumption and the correction threshold.
For conducting SRRC filtering during Tx DSP, the modulation format of the signal is standard 16-QAM, so the bit-width of the signal is 3. We test the variation exhibited by the SNR with different bit-widths of taps, as shown in Fig. 4a, and the optimal tap bit-width is 10. For the SRRC matched filtering in the Rx DSP, we also test the influences of different tap and signal bit-widths on the SNR. The optimal tap and signal bit-widths are 7 and 8, respectively, as shown in Fig. 4b. The use of a small bit-width will generate quantization noise, leading to a decrease in the SNR. Overflow occurs when a bit-width that is too large is used, resulting in a sharp decrease in the SNR. This phenomenon is consistent with the previous conclusion. We employ multi-subcarrier multiplexing (e.g., 4/8 subcarriers) to achieve higher transmission rates. Crucially, the optimal bit-width distribution identified through the aforementioned point-by-point scanning strategy remains applicable in multi-subcarrier scenarios (see Supplementary Note 2 and Supplementary Fig. 3).
a The variation exhibited by the SNR of the system with the bit-widths of the taps in the NF algorithm at the transmitter. b The variation exhibited by the SNR of the system with the bit-widths of the signal (bwx) and taps (bwh) in the NF block at the receiver. c BERs obtained for different OSNRs via multiple DSP schemes in the numerical simulation. Through numerical simulation, it is proven that the performances of the three digital signal processing schemes are identical under 16-bit fixed-point computation. SNR signal-to-noise ratio, NF Nyquist filtering, BER bit error rate, OSNR optical signal-to-noise ratio, C-FEC concatenated staircase forward error code.
Through numerical simulation, we validate the performance of the proposed FNT-based DSP strategy. All DSP schemes use 16-bit fixed-point computation. Figure 4c indicates that the performances of the FNT, FFT and FIR schemes are almost identical.
Experimental demonstration of BiDi transmission
To validate the proposed scheme, we conduct BiDi transmission over a 5.1-km NANF using the experimental setup depicted in Fig. 5a. As a proof of concept of BiDi transmission at a single wavelength, we set the wavelength at both transmitter 1 and transmitter 2 to 1550.01 nm.
a The detailed experimental setup used to verify the performance of the system. b BERs obtained for different ROPs under multiple CSPRs. The optimal CSPR of the system with an FBG is smaller than that without the FBG. c Spectrum of the OIL input signal and the regenerated LO without the FBG. d Spectrum of the OIL input signal and the regenerated LO with the FBG. This finding indicates that the use of the FBG reduces the residual signal component of the regenerated LO. ECL external cavity laser, OC optical coupler, CDM coherent driver modulator, DP-IQM dual-polarization in-phase/quadrature modulator, PR polarization rotator, AWG arbitrary waveform generator, EA electrical amplifier, EDFA erbium doped fibre amplifier, BPF band-pass filter, VOA variable optical attenuator, SSMF standard single-mode fibre, NANF nested anti-resonant nodeless fibre, PC polarization controller, VODL variable optical delay line, PBS polarization beam splitter, PM-CIR polarization-maintaining circulator, ICR integrated coherent receiver, DSO digital storage oscilloscope, BER bit error rate, ROP received optical power, CSPR carrier-to-signal power ratio, FBG fibre Bragg grating, OIL optical injection locking, LO local oscillator.
To reduce the residual signal component after conducting OIL, it is necessary to lower the injection power of the OIL to reduce the locking range. However, after the injection power is reduced to approximately −40 dBm, the locked state can be easily lost within minutes. The injection power is controlled simultaneously by the ROP and the CSPR. A high CSPR (>3 dB) results in a large bias offset for the modulator, causing the signal voltage to enter the nonlinear modulation region, introducing device nonlinearity and affecting the quality of the signal. Moreover, a high CSPR indicates elevated LO power levels, which induce a corresponding reduction in the signal power. A low CSPR (less than −6 dB) makes it difficult for OIL to maintain a locked state for a long period under a low ROP. Therefore, we use an FBG to reduce the demand for the CSPR to ensure high-performance transmission.
We obtain the ROP-BER curves at different CSPRs under 56 (two subcarriers, 2 × 28)-GBaud DP-16-QAM, and the results are shown in Fig. 5b. The typical value of CSPR for the proposed scheme is −3 dB. We also test the ROP-BER curve when no FBG filtering is performed in the receiver. According to the results in Fig. 5b, the residual signal component penalizes the receiver sensitivity of 2.03 dB even when we reduce the OIL injection power to enhance the ability to suppress the residual signal. When no FBG filtering is implemented before performing OIL, the increase in the CSPR leads to an increase in the degree of residual signal suppression performed by OIL (the locking range of OIL is shown in the “Methods” section). Therefore, compared with OIL after conducting FBG filtering, the optimal CSPR increases (the optimal CSPR is ~4 dB). Figure 5c and d show the spectra of regenerated LO after OIL with and without an FBG, indicating that the suppression rate of the residual signal increases by more than 16 dB, which can be considered a perfect LO regeneration process.
To test the performance of unidirectional (UniDi) and BiDi transmission within the NANF and SSMF in the system, we verify the impacts of the launch power and opposite-direction launch power on the performance of the 5.1-km NANF and 5-km SSMF, obtaining the optimal injection power for different fibres. The low nonlinearity of the NANF can be demonstrated by Fig. 6a. Even with the optimal launch power, a high launch power in the opposite direction still leads to a transmission performance penalty, as shown in Fig. 6b. The spectra produced when the launch power and opposite launch power are set to 6 dBm and 16 dBm are shown in Fig. 6c and d, respectively. In the SSMF, the LO in the opposite direction with high intensity leads to backwards simulated Brillouin scattering, which results in a concavity in the subcarrier spectrum at shorter wavelengths and a protrusion in the subcarrier spectrum at longer wavelengths.
a BER of the system versus the launch power for different fibres. b BER of the system versus the launch power in the opposite direction through different fibres. The penalty imposed on the BiDi transmission performance of SSMF increases when the launch power exceeds 10 dBm in two directions, whereas the transmission performance in the NANF is always consistent. c Spectrum of the signal through the NANF in BiDi transmission scenarios. d Spectrum of the signal passing through the SSMF in the BiDi transmission scenarios. The performance degradation exhibited by the BiDi transmission scheme used in the SSMF is caused by a power transfer with a frequency interval of 11 GHz, which is attributed to backward simulated Brillouin scattering. e BER versus ROP for the NANF and SSMF in the BiDi and UniDi transmissions. f BER versus ROP for NANF and SSMF at different baud rates. BiDi bidirectional, SSMF standard single-mode fibre, NANF nested anti-resonant nodeless fibre, BER bit error rate; OBTB optical back-to-back transmission, UniDi unidirectional.
Therefore, when an SSMF is used for BiDi transmission, it is necessary to strictly control the launch power and the opposite launch power. In contrast, the NANF results in much lower fibre nonlinearity, allowing for higher-power BiDi transmission processes, which provide advantages for higher-order modulation formats. Under the optimal launch power (NANF: 22 dBm, SSMF: 6 dBm) and a CSPR of −4 dB, we obtain the optimal ROP-BER curves for the SSMF and NANF in the UniDi and BiDi transmission cases, respectively. In the NANF, the UniDi and BiDi transmission performances are identical, whereas implementing BiDi transmission in the SSMF results in a receiver sensitivity penalty of 0.52 dB at the BER threshold of 1.21 × 10−3, as shown in Fig. 6e. The NANF improves the receiver sensitivity of BiDi transmission by 1.27 dB over that attained with the SSMF. A comparison between the performances of the two transmitters is shown in Fig. 6f. The receiver sensitivity of the system is 1.14 dB lower at 56 GBaud than at 38 GBaud.
Discussion
The flexibility and scalability of the proposed scheme
The solution we proposed has the ability to achieve various transmission rates and distances. In hollow-core-fibre channels, residual chromatic dispersion (~4 ps/nm/km) becomes the main source of damage. Therefore, for different transmission scenarios, the ISI introduced by chromatic dispersion is considered.
ISI causes an increase in the overlap length of CDC and AEQ, which can be expressed as36:
where fmax is the maximum frequency of the received signal and Fs is the sampling rate of the signal. c, L, D, and fc are the speed of light, fibre length, CD coefficient and frequency of the optical wave, respectively. Ne is the number of sampling points affected by inter-symbol interference. Since Fs is directly proportional to the signal baud rate, the minimum overlap length is proportional to the square of the signal baud rate and the fibre length. As DSCM technology reduces the baud rate of the produced signal, the ISI caused by CD is greatly suppressed. As a verification of the concept, both the signals in the numerical simulation and the experiment conducted in this article are transmitted in the form of two subcarriers.
The tap length of the AEQ is 32, and the overlap length is 16. For 1.6-Tb s−1 long-range (LR, 2–10 km) scenarios, deploying two subcarriers in the 16-QAM modulation format can achieve AEQ using FNT. Four subcarriers with rates of 112 GBaud are needed for future 3.2-Tb s−1 transmission. For a 1.6-Tb s−1 Metro DCI scenario with a typical transmission distance of 80–120 km, the signal can be divided into 8 subcarriers with 28-GBaud rate subcarriers for transmission. Therefore, although our experiment was conducted at a data rate of 400 Gb s−1 due to the limitations of the experimental conditions, the experimental results still validate the effectiveness of the simplified coherent structure and DSP strategy we proposed in the data centre optical interconnects with higher transmission capacity in the future.
Our work offers a groundbreaking approach by cross-domain collaboration of transmission architecture, fibre hardware, and algorithms, fundamentally reshaping the foundation of mainstream FFT-based DSP strategy in coherent optical interconnects. Compared with conventional simplification approaches18,37, our scheme avoids compromising receiver sensitivity while offering enhanced flexibility. Furthermore, it maintains backward compatibility with conventional simplification approaches, enabling simultaneous deployment with scenario-specific optimizations on the structure of algorithms.
Hardware complexity and performance estimation of DSP schemes
By calculating the number of real-valued multiplications (RMPs) and real-valued additions (RADs) consumed for each symbol (in one state of polarization), we compare the complexity of the proposed FNT-based NF and AEQ algorithms with those of the FFT, FIR and 2D-FNT schemes.
The RMPs and RADs of the different schemes are shown in Fig. 7a (refer to the “Methods” section for the workflow of the detailed complexity derivation across different schemes). Compared with the traditional single-carrier FFT scheme, the proposed FNT NF + AEQ scheme reduces the RMPs by 93.75% and the RADs by approximately 45.7%. Compared with the 2D-FNT scheme we previously proposed, the number of RMPs and RADs is further reduced by 40% and 39.5%, respectively. To demonstrate the superiority of the complexity of the proposed scheme, we analyse the overall DSP complexity levels observed under the four schemes. We combine and normalize the complexity of the multipliers and adders by comparing their numbers of half adders, as shown in Fig. 7b. Where a 16-bit Wallace multiplier contains 411 half adders and a 16-bit adder contains 32 half adders38. Our solution provides a complexity reduction of ~90% relative to the conventional coherent FFT-based DSP approach36,39, excluding the necessary FEC algorithm.
a Statistics of RMPs and RADs in the NF and AEQ blocks based on different techniques. b The normalized complexity levels of different schemes. c BERs versus ROPs for an NANF implemented via the FNT-DSP scheme and FFT-DSP scheme. RMP real-valued multiplication, RAD real-valued addition, NF Nyquist filtering, AEQ adaptive equalization, CPR carrier phase recovery, FOE frequency offset estimation, Single-C single carrier, DSCM digital subcarrier multiplexing, Single-C Coh. single carrier modulation with coherent structure, RCM Coh. residual carrier modulation with coherent structure, Single-C SDM-SHCD single carrier modulation with the spatial division multiplexing SHCD structure, BER bit error rate, ROP received optical power, FNT Fermat number transform, FFT fast Fourier transform.
Furthermore, we compare the performance achieved by FNT-DSP and FFT-DSP using data obtained from BiDi transmission in the NANF, as shown in Fig. 7c. Under 16-bit fixed-point computations, the receiver sensitivity of FNT-DSP has almost no penalty relative to that of conventional FFT-DSP.
To further validate the advantages of FNT-DSP, we implement the FNT algorithm on an FPGA platform (XCVU13P-1FLGA2577E). We compare the resource usage between 16-bit FFT and 16-bit FNT for a transform length of 32, as shown in Table 1.
Notably, the FNT eliminates the use of the valuable DSP resources in the FPGA, and the occupation of lookup table (LUT) and register resources is substantially lower than that of FFT. Although we only validate the complexity of the transform process on the FPGA platform, this comparison of resource usage still provides strong evidence for the superiority of our proposed FNT-DSP scheme in complexity reduction.
Therefore, as a proof of concept, our work validates the feasibility of the holistic co-design combining FNT-DSP, SHCD architecture, and AR-HCF. It concurrently demonstrates that this co-design achieves substantial reductions in transmission latency and power consumption, offering a promising approach for next-generation high-speed coherent optical interconnects.
Methods
Generating a direct-current frequency pilot tone
The DC component, originating from the same laser as the signal, is directly induced by the slight bias offset of the modulator. The transfer function of the IQ modulator can be expressed as follows:
where Ein and Eout are the input and output optical amplitudes respectively; \({V}_{{{{\rm{\pi }}}}}\) is the half wave voltage of the IQ modulator, VoffsetI and VoffsetQ are the offsets of the bias voltage; VbiasI and VbiasQ are the bias voltage; VsigI(t) and VsigQ(t) are the signal voltages; \({{\varphi }}_{{{\mbox{p}}}}\) is the phase shift of the quadrature signal. The signal is loaded in the form of digital subcarriers, and Eq. (5) is obtained.
where Vs(nTs) and \({\varphi }(n{T}_{{{\mbox{s}}}})\) are the modulation amplitude and the phase component of the signal, respectively. \({\omega }_{i}\) is the frequency offset between the ith subcarrier and the DC-FPT. If VBiasI and VBiasQ are set to \({-V}_{\pi }\) and the phase shift \({{\varphi }}_{{{\mbox{p}}}}\) is set to \(\frac{\pi }{2}\), then the Taylor expansion can be used to obtain an approximate expression for the output amplitude in the linear modulation region.
where \({{\varphi }}_{{{\mbox{offset}}}}\) is the phase of the DC component. The generated DC component is proportional to the square of the bias offsets VoffsetI and VoffsetQ. This component can be considered DC-FPT. Equation (6) shows that the signal and DC components are separated from each other. Therefore, the LO component is generated between the guard intervals of the DSCM signal with no additional complexity or a receiver sensitivity penalty.
The detailed parameters of FNT
The transformation and its inverse transformation with lengths of N can be defined by the following expressions:
where x(n) and X(k) are sequences in the time domain and transformation domain, respectively. F and α are the modulus and the radix of the FNT, respectively. The modulus F is a Fermat number \({F}_{q}={2}^{{2}^{q}}+1\). F and α must satisfy the following expressions33:
where pi is the prime factor of F. Therefore, we obtain a transformation with convolutional properties in a finite integer domain (Fig. 3b), which is similar to FFT (Fig. 3a) in the complex-valued domain. The radix of the FNT can also be \(\sqrt{2}\), which can be expressed as shown below:
We denote \({{\alpha }}_{\sqrt{2}}\) as \(\sqrt{2}\) because \({{{\alpha }}_{\sqrt{2}}}^{2}=2\left({{\mathrm{mod}}}{F}_{q}\right)\). Thus, \(\sqrt{2}\) can also be regarded as the effective radix for eliminating the multiplications of the FNT.
According to the above theory, we provide several typical parameters of the FNT (including the modulus \({F}_{q}\), radix α, and transformation length N of the FNT) as shown in Table 2.
The detailed complexity calculation in different schemes
The number of complex-valued multiplications (CMPs) (NCMPs-FFT) and additions (CADs) (NCADs-FFT) in the FFT are given as
where N represents the length of the transformation. One CAD is equivalent to two RADs. We employ a simplified scheme utilizing 3 RMPs and 5 RADs for one CMP34. Note that the FNT involves only real-valued operations. The numbers of RMPs (NRMPs-FNT) and RADs (NRADs-FNT) in the FNT are given as
The algorithmic structure simplification is effective for all the schemes that are based on different transformations. Therefore, we ignore the simplification of the AEQ structure and compare the hardware complexity levels using a 4 × 4 real-valued multiple-input and multiple-output (44RV-MIMO) structure here40. 1−ηSRRC and 1−ηAEQ are the overlap ratios in the RRC and AEQ blocks, respectively. εSRRC and εAEQ are the corresponding upsampling rates. In Tx DSP, the two subcarriers are upsampled to the sampling rate of an arbitrary waveform generator (AWG), and after performing SRRC filtering based on FNT, subcarrier multiplexing is performed. According to ref. 41, for SRRC roll off filtering with a roll off factor (ROF) of 0.05, a 64-tap FDE filter with an overlap length exceeding 20 is generally used to reduce the out-of-band power. Here we use the FNT with a transform length N of 64, modulus of F4 and a radix of \(\sqrt{2}\). In Rx, the signal is sampled by a digital sampling oscilloscope (DSO) at the same sampling rate as that of AWG, and performs subcarrier demultiplexing. Subsequently, we can obtain the signal in one subcarrier through FNT-based SRRC matched filtering, while downsampling to 1-sps. Therefore, εSRRC, εAEQ, ηSRRC, and ηAEQ are set to 2, 1, 0.5, and 0.5, respectively. The SRRC length NSRRC is set to 32, whereas the AEQ length NAEQ is set to 16. The transform lengths of SRRC and AEQ algorithms (MFNT-SRRC, MFFT-SRRC, MFNT-AEQ, MFFT-AEQ) are represented as
For the SRRC filtering and matched filtering blocks, taps are constant real-valued number, while the signal is divided into real and imaginary parts as a complex-valued sequence. Consequently, FNT-based SRRC filtering involves eight FNT operations (four at the transmitter and four at the receiver) and multiplication operations between the signal and tap sequences within the Fermat number domain (FND). The numbers of RMPs (NRMPs-SRRC-FNT) and RADs (NRADs-SRRC-FNT) in the FNT-based SRRC algorithm are expressed by Eq. (14).
where \({N}_{{{\mbox{RADs}}}-{{\mbox{FNT}}}-\sqrt{2}}\) represents the number of additional RADs when we use \({{\alpha }}_{\sqrt{2}}\) as the radix of FNT.
Similarly, the FFT-based SRRC employs 8 FFT processes. Considering that performing FFT on real-valued sequences reduces computational complexity, the RMPs (NRMPs-SRRC-FFT) and RADs (NRADs-SRRC-FFT) in FFT-based SRRC are expressed as
where \(5/3\times {N}_{{{\mbox{RMPs}}}-{{\mbox{SRRC}}}-{{\mbox{FFT}}}}\) represents the number of RADs introduced by the RMPs in the FFT process.
The RMPs (NRMPs-SRRC-FIR) and RADs (NRADs-SRRC-FIR) in FIR-based SRRC are expressed as
In the FNT-based AEQ block, 16 real-valued adaptive filters are split into 32 real-valued filters with a bit-width of 8 for convolution. Therefore, the FNT-based AEQ block involves 44 FNT processes. The numbers of RMPs (NRMPs-AEQ-FNT) and RADs (NRADs-AEQ-FNT) in the FNT-based AEQ algorithm are expressed as follows:
The FFT-based AEQ block involves 24 FFT processes. The numbers of RMPs (NRMPs-AEQ-FFT) and RADs (NRADs-AEQ-FFT) in the FFT-based AEQ algorithm are expressed as follows:
The numbers of RMPs (NRMPs-AEQ-FIR) and RADs (NRADs-AEQ-FIR) in the FIR-based AEQ algorithm are expressed as follows:
Since the output of the AEQ block contains signals of two polarization states, the computational complexity of all aforementioned AEQ implementations should be halved to enable fair comparison. In single-carrier implementations, the AEQ transform lengths (\({M}_{{{\mbox{FFT}}}-{{\mbox{AEQ}}}}\) and \({M}_{{{\mbox{FNT}}}-{{\mbox{AEQ}}}}\)) are baud rate-dependent as shown in Eq. (3). In this work, the AEQ transform length of the single-carrier scheme is set to be 128.
The carrier phase recovery (CPR) algorithms we use in the conventional coherent structure are the Viterbi-and-Viterbi (V&V)39 and residual carrier modulation (RCM) schemes29,42. Frequency offset compensation (FOC) is implemented by utilizing FFT to the fourth-power of the signal sequence (FFT-based FOC). The length of the RCM block (NRCM) and the length of the FOC block (NFOC) are set to be 256. The parameter L, which represents the length of the low-pass filter, is configured as 512 according to ref. 43.
Consequently, the numbers of RMPs (NRMPs-CPR-V&V) and RADs (NRADs-CPR-V&V) in the V&V-CPR algorithm are expressed as follows:
The numbers of RMPs (NRMPs-CPR-RCM) and RADs (NRADs-CPR-RCM) in the RCM-CPR algorithm are expressed as follows:
The numbers of RMPs (NRMPs-FFT-based-FOC) and RADs (NRADs-FFT-based-FOC) in the FFT-based FOC algorithm are expressed as
The numbers of RMPs (NRMPs-FOC-RCM) and RADs (NRADs-FOC-RCM) in the RCM-FOC algorithm are expressed as follows:
From the analysis presented above, we obtain the overall DSP complexity of different schemes, as shown in Fig. 7a and b.
The detailed experimental setup
An external cavity laser (ECL, ID Photonics, CoBrite Dx4) is used as transmitters 1 and 2. A signal is generated via the Tx DSP algorithms illustrated in Fig. 1e and f. After generating the baseband signal, an AWG (Keysight M8196A), operating at 90 GSa/s, transforms the signal into four analogue tributaries. They are then sent to the coherent driver modulator (CDM, NeoPhotonics Class 60) for modulation and generating the DC-FPT at transmitter 1. Owing to the limitations of the experimental conditions, we use a dual polarization IQ modulator (DP-IQM, Fujitsu FTM7995HN) at transmitter 2. The 56 GBaud electrical DP-16QAM and 38 GBaud electrical DP-16QAM with symbol lengths of 65,536 are assigned to transmitters 1 and 2, respectively. The DSCM signal modulated with the DC-FPT is amplified by an erbium-doped fibre amplifier (EDFA). A bandpass filter (BPF) with a bandwidth of 100 GHz is used for filtering to eliminate the band noise introduced by the EDFA. The VOA at the transmitter is used to control the launch power of the fibre. Two circulators are deployed at both ends of the fibre to ensure that when transmitting the signals of transmitters 1 and 2 via ports 1 and 2 of the circulators, the signals of transmitters 2 and 1 can be received simultaneously via port 3.
At the receiver, a VOA is used to adjust the received optical power (ROP) of the signal. To reduce the signal power penalty, we use a 90:10 coupler to separate a small amount of the signal to regenerate the LO. A pure DC-FPT can be obtained through filtering via an FBG. The centre wavelength of the utilized FBG is 1550.01 nm, which matches that of the DC-FPT, with a 3 dB bandwidth of ~3.5 GHz. After filtering, the obtained DC-FPT is used as the optical seed source for the OIL, and the polarization of the optical seed source is controlled by a PC. A VODL is used to align the transmission delay between the signal and the regenerated LO, avoiding the additional phase noise caused by the relative time delay (RTD). The DSCM signal and regenerated LO are sent directly to the integrated coherent receiver (ICR, NeoPhotonics Class 40). Following coherent detection, the XI, XQ, YI and YQ components are sampled and received by a DSO (LeCory LabMaster 10-36Zi-A) at a sampling rate of 80 GHz for subsequent offline processing. The optical spectrum in the experiment is acquired through an optical spectrum analyser (OSA, APEX technologies, AP2081A) with a resolution of 0.04 pm. All the experimental results are obtained via the proposed FNT-DSP.
BER characterization
Each BER value is calculated via a of 216 (65,536)-length pseudo-random binary sequence transmitted on subcarrier −1. We determine the performance of the two subcarriers in the experiment, as shown in Supplementary Fig. 4. The performance of the two subcarriers is consistent. Therefore, in the presentation of the above experimental data, we take the BER value of subcarrier −1 to verify the system performance. Notably, to evaluate the transmission performance of two polarization states (X and Y polarizations), the BER of each subcarrier is defined as the arithmetic mean of the BER values from both polarization states.
Locking range of optical injection locking
The regeneration of the LO is the key to the above-described structure. The free-running wavelength of the OIL-DFB laser is adjusted closely to the DC-FPT by utilizing a thermoelectric cooler (TEC) module to realize the regeneration of LO, as shown in Fig. 8a. It is necessary to test the frequency locking range of the OIL-DFB laser used in our experiment. The locking range of the OIL is related to the injection power, satisfying the following expression44,45:
where fd is the cavity mode spacing of the OIL-DFB laser; Pin is the injected power; and Pout is the output power of the OIL-DFB laser. α is the linewidth enhancement factor. \(\triangle {\omega }\) represents the drift between the free-running central frequency of the OIL-DFB laser and the frequency of the desired regenerated LO. To distinguish the possibly unstable OIL region, we conduct 20 individual measurements for each set of different frequency offsets and injection powers, and the results are shown in Fig. 8b. With −35 dBm injection power, the tolerant FO range of OIL is −200 to 0 MHz. The asymmetric injection power tolerance at positive and negative frequency offsets is caused by the negative frequency detuning induced by the non-zero linewidth enhancement factor.
Data availability
The data that support the findings of this study are available from the corresponding authors upon reasonable request.
Code availability
The codes that support the findings of this study are available from the corresponding authors upon reasonable request.
References
Zhang, C. et al. Clone-comb-enabled high-capacity digital-analogue fronthaul with high-order modulation formats. Nat. Photon. 17, 1000–1008 (2023).
Zhou, Z. et al. Communications with guaranteed bandwidth and low latency using frequency-referenced multiplexing. Nat. Electron. 6, 694–702 (2023).
Xie, C. & Cheng, J. Coherent optics for data center networks. In 2020 IEEE Photonics Society Summer Topicals Meeting Series (SUM) 1–2 (IEEE, 2020).
Pan, J. & McElhannon, J. Future edge cloud and edge computing for Internet of Things applications. IEEE Internet Things J. 5, 439–449 (2018).
Cao, J. et al. Real-Time 1.6 T (2 × 800 G) optical interconnection with coherent BiDi and SiP-based polarization tracker for dual LOs with 1 THz spacing. In 2023 Asia Communications and Photonics Conference/2023 International Photonics and Optoelectronics Meetings (ACP/POEM), 1–4 (IEEE, 2023).
Gui, T., Wang, X. & Tang, M. Real-time demonstration of homodyne coherent bidirectional transmission for next-generation data center interconnects. J. Lightwave Technol. 39, 1231–1238 (2021).
Zhou, X., Urata, R. & Liu, H. Beyond 1 Tb/s intra-data center interconnect technology: IM-DD OR coherent?. J. Lightwave Technol. 38, 475–484 (2020).
Jia, Z. & Campos, L. A. Coherent optics ready for prime time in short-haul networks. IEEE Netw. 35, 8–14 (2021).
Faruk, M. S. et al. Coherent passive optical networks: why, when, and how. IEEE Commun. Mag. 59, 112–117 (2021).
Nagarajan, R., Lyubomirsky, I. & Agazzi, O. Low power DSP-based transceivers for data center optical fiber communications (Invited Tutorial). J. Lightwave Technol. 39, 5221–5231 (2021).
Kupfer, T., Bisplinghof, A., Duthel, T., Fludger, C. & Langenbach, S. Optimizing power consumption of a coherent DSP for metro and data center interconnects. In 2017 Optical Fiber Communication Conference (OFC) Th3G.2 (IEEE, 2017).
Mi, G., He, X. & Zheng, X. Relevance of Latency in Ethernet Networking for AI Infrastructure. In 2024 European Conference on Optical Communication (ECOC) 1299–1302 (IEEE, 2024).
Wang, L. et al. Bi-directional self-homodyne transmission with MIMO-free DSP for next-generation data center interconnects. J. Lightwave Technol. 40, 6179–6189 (2022).
Sowailem, M. Y. S. et al. Self-homodyne system for next generation intra-datacenter optical interconnects. Opt. Express 25, 27834 (2017).
Zhang, R., Kuzmin, K., Chen, Y.-W. & Way, W. I. 800G/λ Self-homodyne coherent links with simplified DSP for next-generation intra-data centers. J. Lightwave Technol. 41, 1216–1222 (2023).
Zhang, C. et al. High-fidelity sub-petabit-per-second self-homodyne fronthaul using broadband electro-optic combs. Nat. Commun. 15, 6621 (2024).
Cheng, J., Xie, C., Tang, M. & Fu, S. A low-complexity adaptive equalizer for digital coherent short-reach optical transmission systems. In 2019 Optical Fiber Communication Conference (OFC) M3H.2 (IEEE, 2019).
Yang, Y. et al. Cost-effective and robust DSP scheme for a short-reach coherent system in the presence of transmitter IQ skew and chromatic dispersion. Opt. Lett. 46, 4606–4609 (2021).
Liu, Z., Chen, S., Chen, J., Li, Y. & Tang, M. Low complexity equalization in coherent optical communication based on Fermat number transform. Opt. Lett. 49, 3910–3913 (2024).
Liu, J. et al. Low hardware-complexity IQ skew tolerant finite field and time domain hybrid adaptive equalization for 400G/800G-ZR coherent transmission. Opt. Express 32, 43020–43032 (2024).
Xing, Y. et al. Low-complexity chromatic dispersion compensation using high-radix Fermat number transform. J. Lightwave Technol. 42, 5190–5203 (2023).
Hong, Y. et al. Hollow-core NANF for high-speed short-reach transmission in the S+C+L-bands. J. Lightwave Technol. 39, 6167–6174 (2021).
Meseguer, A. C., De Araujo, J. T. & Antona, J.-C. Multi-core vs. hollow-core fibers: technical study of their viability in SDM power-constrained submarine systems. J. Lightwave Technol. 41, 4002–4009 (2023).
Russell, P. Invited commentary: hollow core glass fibre—the perfect pipe for light. eLight 5, 4 (2025).
Yang, H. et al. 800G Low-latency photonic data-center interconnections over 5 km hollow-core fiber. IEEE Commun. Mag. 63, 1–7 (2024).
Ge, D. et al. Estimation of Kerr nonlinearity in an anti-resonant hollow-core fiber by high-order QAM transmission. In 2023 Optical Fiber Communications Conference and Exhibition (OFC), 1–3 (IEEE, 2023).
Iyer, A., Xu, W., Antonio-Lopez, J. E., Correa, R. A. & Renninger, W. H. Ultra-low Brillouin scattering in anti-resonant hollow-core fibers. APL Photonics 5, 096109 (2020).
Michaud-Belleau, V. et al. Backscattering in antiresonant hollow-core fibers: over 40 dB lower than in standard optical fibers. Optica 8, 216–219 (2021).
Fang, X. et al. Overcoming laser phase noise for low-cost coherent optical communication. Nat. Commun. 15, 6339 (2024).
Du, H. et al. Self-homodyne coherent detection transmission through a (1+8) multicore fiber with one polarization-maintaining core. J. Lightwave Technol. 42, 1460–1468 (2024).
Bradley, T. D. et al. Antiresonant hollow core fibre with 0.65 dB/km attenuation across the C and L telecommunication bands. in 2019 European Conference on Optical Communication (ECOC) 1–4 (IEEE, 2019).
Poletti, F. et al. Towards high-capacity fibre-optic communications at the speed of light in vacuum. Nat. Photon. 7, 279–284 (2013).
Agarwal, R. & Burrus, C. Fast convolution using Fermat number transforms with applications to digital filtering. IEEE Trans. Acoust. Speech Signal Process. 22, 87–97 (1974).
Leibrich, J. & Rosenkranz, W. Frequency domain equalization with minimum complexity in coherent optical transmission systems. In 2010 Optical Fiber Communication Conference (OFC) OWV1 (IEEE, 2010).
Wang, W. et al. Real-time FPGA investigation of potential FEC schemes for 800G-ZR/ZR+ forward error correction. J. Lightwave Technol. 41, 926–933 (2023).
Kudo, R. et al. Coherent optical single carrier transmission using overlap frequency domain equalization for long-haul optical systems. J. Lightwave Technol. 27, 3721–3728 (2009).
Cheng, J., Xie, C., Tang, M. & Fu, S. Hardware efficient adaptive equalizer for coherent short-reach optical interconnects. IEEE Photon. Technol. Lett. 31, 1249–1252 (2019).
Waters, R. S. & Swartzlander, E. E. A reduced complexity wallace multiplier reduction. IEEE Trans. Comput. 59, 1134–1137 (2010).
Kuschnerov, M. et al. DSP for coherent single-carrier receivers. J. Lightwave Technol. 27, 3614–3622 (2009).
Paskov, M., Lavery, D. & Savory, S. J. Blind equalization of receiver in-phase/quadrature skew in the presence of Nyquist filtering. IEEE Photon. Technol. Lett. 25, 2446–2449 (2013).
Wang, J., Xie, C. & Pan, Z. Generation of spectrally efficient Nyquist-WDM QPSK signals using digital FIR or FDE filters at transmitters. J. Lightwave Technol. 30, 3679–3686 (2012).
Chen, Y. et al. Low-cost 100 G PON based on a residual optical carrier for carrier recovery. Opt. Lett. 50, 181 (2025).
Fan, L. et al. Hardware-efficient, ultra-fast and joint polarization and carrier phase tracking scheme based on frequency domain pilot tones for DSCM systems. J. Lightwave Technol. 41, 1454–1463 (2023).
Liu, Z. & Slavík, R. Optical injection locking: from principle to applications. J. Lightwave Technol. 38, 43–59 (2020).
Liu, Z., Kim, J.-Y., Wu, D. S., Richardson, D. J. & Slavík, R. Homodyne OFDM with optical injection locking for carrier recovery. J. Lightwave Technol. 33, 34–41 (2015).
Acknowledgements
This work was supported by the National Natural Science Foundation of China (Nos. 62225110, 62205114 and 61931010), the Major Programme (JD) of Hubei Province (No. 2023BAA001-1) and the Hubei Optical Fundamental Research Centre.
Author information
Authors and Affiliations
Contributions
M.T. conceived the concept of this work. S.C. and Z.L. implemented the experiments and algorithms. M.T., M.Z., C.Z., and Z.H. provided guidance and assisted with the experiments. S.C. and Z.L. analysed the results of the numerical simulations and experiments. Z.Z., P.L., L.Z. and J.L. provided the NANF. X.H., P.L., L.Z. and J.L. realized the splicing between NANF and SSMF. C.C. conducted performance tests on NANF. X.L. helped analyse the complexity of DSP. All the authors participated in the writing of the manuscript. M.T. supervised this project.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Communications Engineering thanks Jianyang Shi and the other, anonymous, reviewers for their contribution to the peer review of this work. Primary Handling Editors: [Chaoran Huang] and [Wenjie Wang, Rosamund Daw].
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Chen, S., Liu, Z., Zhao, C. et al. Coherent optical interconnects using Fermat number transform and hollow core fibre. Commun Eng 4, 169 (2025). https://doi.org/10.1038/s44172-025-00505-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s44172-025-00505-3