# Evaluation Report: HK 4-array v3.2 vs CAE v1.2 Microphones

Report date: 2026-06-09

## 1. Objective

This report compares two robot microphone options: HK mic 4 array linear v3.2 and CAE v1.2. Each dataset contains the real test microphone and an `obs` laptop-microphone recording captured as a simultaneous reference.

The test conditions cover 1 m, 2 m, and 3 m speech distance. For each distance, recordings were made without robot-speaker playback and with robot-speaker playback to evaluate echo robustness.

## 2. Input Data

- HK real microphone: `/home/thachhs/end_baocaomic/michk_32/hk/<1m|2m|3m>/*.wav`.
- HK laptop reference: `/home/thachhs/end_baocaomic/michk_32/obs/<1m|2m|3m>/*.wav`.
- CAE real microphone: `/home/thachhs/end_baocaomic/micanhthao/micmoi/<1m|2m|3m>/*.wav`.
- CAE laptop reference: `/home/thachhs/end_baocaomic/micanhthao/obs/<1m|2m|3m>/*.wav`.

Files ending in `_echo` are treated as robot-speaker playback / echo cases. For example, `1m/1_echo.wav` from the real microphone is paired with `1m/1_echo.wav` under the matching `obs` folder.

## 3. Technical Metrics

- LUFS: integrated loudness, used to compare perceived recording level.
- RMS/Peak dBFS: average and peak level relative to full scale.
- Estimated SNR: active RMS minus noise floor; the noise floor is estimated from the lowest-energy 20% of frames.
- Absolute active %: percentage of frames above -50 dBFS; useful for detecting suppressed or missing audio.
- Leading silence ms: initial near-silent duration below -55 dBFS.
- Longest silence ms: longest near-silent interval below -55 dBFS.
- Dropout score: 0-100 score combining silence ratio, long gaps, start loss, and near-muted files.
- Echo risk flags: echo files are flagged if loudness drops by at least 5 LU against the same-distance no-echo baseline, active ratio is below 45%, speech-band energy is below 30%, SNR is below 28 dB, envelope correlation with OBS is below 0.30, duration is abnormal, or an energy dropout is detected.
- Speech %: energy share in the 300-3400 Hz speech band.
- Echo index: envelope autocorrelation heuristic in the 50-500 ms lag region; used only for relative comparison because no separate far-end signal is available for true ERLE.
- LSD dB: log-spectral distance between the real microphone and laptop reference within the same paired recording.

## 4. Executive Summary

- Without robot speaker playback, both microphones captured usable speech: average SNR is 54.8 dB for HK and 39.1 dB for CAE.
- With robot-speaker echo, HK is more stable on the key indicators: HK echo SNR is 42.0 dB versus 31.5 dB for CAE; HK echo active ratio is 71.8% versus 65.6% for CAE; HK/OBS envelope correlation is 0.506 versus 0.421 for CAE/OBS.
- CAE shows stronger suppression / information-loss risk in echo: CAE echo LUFS drops by -4.4 LU versus its no-echo baseline at the same distance, while HK changes by +4.0 LU.
- Echo files with at least two risk flags: HK 0/15, CAE 8/21. Low-loudness echo files: HK 0, CAE 7; low-active files: HK 0, CAE 4; abnormally low speech-band files: HK 0, CAE 1.
- Average echo loudness is -23.6 LUFS for HK and -24.7 LUFS for CAE. No meaningful clipping was detected on either microphone.
- Deployment favors HK: it is compact and includes tuning software; CAE requires an external sound card and a more complex setup flow.

## 5. Overall HK vs CAE Summary

| Mic             |   N normal |   N echo | Normal SNR dB   | Echo SNR dB   | Normal LUFS   | Echo LUFS   | Echo dropout score   |   Echo start-loss files |   Echo long-dropout files |   Echo muted/low files |   Clip max % | Format              | Deployment note                                              |
|:----------------|-----------:|---------:|:----------------|:--------------|:--------------|:------------|:---------------------|------------------------:|--------------------------:|-----------------------:|-------------:|:--------------------|:-------------------------------------------------------------|
| HK 4-array v3.2 |         15 |       15 | 54.8 ± 7.1      | 42.0 ± 11.0   | -27.6 ± 1.2   | -23.6 ± 2.1 | 11.6 ± 6.6           |                       2 |                         3 |                      0 |            0 | 16000 Hz, 2 ch (30) | Compact, with built-in tuning/configuration software.        |
| CAE v1.2        |         30 |       21 | 39.1 ± 1.3      | 31.5 ± 3.8    | -20.3 ± 0.8   | -24.7 ± 3.0 | 9.1 ± 5.6            |                       1 |                         1 |                      0 |            0 | 16000 Hz, 1 ch (51) | More complex installation, requiring an external sound card. |

## 6. Real-Microphone Comparison by Distance and Mode

| Mode   | Distance   | Mic             |   N | LUFS        | RMS dBFS    | SNR dB      | Abs. active %   | Leading silence ms   | Longest silence ms   | Dropout score   |   Muted/low files | Speech %    | Echo index    | Clip %        |
|:-------|:-----------|:----------------|----:|:------------|:------------|:------------|:----------------|:---------------------|:---------------------|:----------------|------------------:|:------------|:--------------|:--------------|
| Normal | 1m         | HK 4-array v3.2 |   5 | -27.0 ± 0.6 | -27.8 ± 1.4 | 52.1 ± 10.5 | 65.9 ± 12.9     | 512 ± 590            | 1006 ± 366           | 21.2 ± 10.2     |                 0 | 67.9 ± 10.9 | 0.868 ± 0.054 | 0.000 ± 0.000 |
| Normal | 1m         | CAE v1.2        |  10 | -20.1 ± 0.4 | -21.3 ± 0.8 | 39.3 ± 1.4  | 60.4 ± 11.3     | 326 ± 270            | 716 ± 294            | 20.1 ± 6.6      |                 0 | 52.5 ± 8.2  | 0.898 ± 0.028 | 0.000 ± 0.000 |
| Normal | 2m         | HK 4-array v3.2 |   5 | -28.2 ± 1.8 | -28.8 ± 1.9 | 54.7 ± 6.5  | 65.0 ± 8.5      | 104 ± 108            | 1024 ± 81            | 20.6 ± 5.0      |                 0 | 80.5 ± 6.2  | 0.853 ± 0.028 | 0.000 ± 0.000 |
| Normal | 2m         | CAE v1.2        |  10 | -20.4 ± 1.1 | -21.6 ± 1.2 | 38.8 ± 1.7  | 63.5 ± 9.6      | 230 ± 184            | 698 ± 346            | 17.8 ± 5.9      |                 0 | 60.0 ± 9.1  | 0.846 ± 0.059 | 0.000 ± 0.000 |
| Normal | 3m         | HK 4-array v3.2 |   5 | -27.6 ± 0.8 | -28.2 ± 0.9 | 57.5 ± 1.9  | 69.4 ± 3.9      | 256 ± 266            | 886 ± 230            | 18.1 ± 2.1      |                 0 | 91.2 ± 3.1  | 0.864 ± 0.039 | 0.000 ± 0.000 |
| Normal | 3m         | CAE v1.2        |  10 | -20.3 ± 1.0 | -21.9 ± 1.2 | 39.2 ± 0.9  | 58.4 ± 5.7      | 237 ± 234            | 962 ± 428            | 20.5 ± 5.4      |                 0 | 51.0 ± 5.3  | 0.897 ± 0.022 | 0.000 ± 0.000 |
| Echo   | 1m         | HK 4-array v3.2 |   5 | -21.5 ± 1.2 | -23.0 ± 1.2 | 41.1 ± 10.9 | 73.1 ± 7.6      | 32 ± 72              | 296 ± 214            | 9.9 ± 4.8       |                 0 | 84.5 ± 3.7  | 0.632 ± 0.103 | 0.000 ± 0.000 |
| Echo   | 1m         | CAE v1.2        |  11 | -25.0 ± 3.6 | -27.0 ± 3.7 | 31.9 ± 4.9  | 60.1 ± 22.3     | 54 ± 94              | 399 ± 353            | 11.0 ± 6.6      |                 0 | 85.5 ± 26.4 | 0.766 ± 0.140 | 0.000 ± 0.000 |
| Echo   | 2m         | HK 4-array v3.2 |   5 | -24.4 ± 2.2 | -25.1 ± 1.8 | 37.7 ± 12.2 | 76.1 ± 9.7      | 112 ± 155            | 396 ± 470            | 9.8 ± 7.7       |                 0 | 88.2 ± 6.3  | 0.619 ± 0.161 | 0.000 ± 0.000 |
| Echo   | 2m         | CAE v1.2        |   5 | -24.9 ± 2.6 | -26.3 ± 3.0 | 29.3 ± 1.9  | 78.0 ± 11.9     | 0 ± 0                | 152 ± 153            | 4.4 ± 3.0       |                 0 | 89.8 ± 3.9  | 0.768 ± 0.064 | 0.000 ± 0.000 |
| Echo   | 3m         | HK 4-array v3.2 |   5 | -24.9 ± 1.1 | -26.5 ± 1.2 | 47.3 ± 10.0 | 66.3 ± 6.5      | 228 ± 482            | 576 ± 419            | 15.2 ± 6.9      |                 0 | 91.2 ± 3.1  | 0.721 ± 0.125 | 0.000 ± 0.000 |
| Echo   | 3m         | CAE v1.2        |   5 | -23.8 ± 1.9 | -26.6 ± 2.0 | 32.7 ± 1.2  | 65.3 ± 6.6      | 64 ± 87              | 284 ± 31             | 9.4 ± 2.0       |                 0 | 97.1 ± 1.0  | 0.772 ± 0.040 | 0.000 ± 0.000 |

## 7. Real Microphone vs Laptop/OBS Paired Comparison

Positive `Mic-OBS` values mean the real microphone is higher than the laptop microphone. Higher `Envelope corr` means the two sources capture the same acoustic events more similarly after time alignment.

| Mode   | Distance   | Mic             |   N pairs | Mic-OBS RMS dB   | Mic-OBS SNR dB   | Active delta pp   | Dropout delta   | Envelope corr   | LSD dB     |
|:-------|:-----------|:----------------|----------:|:-----------------|:-----------------|:------------------|:----------------|:----------------|:-----------|
| Normal | 1m         | CAE v1.2        |        10 | 6.4 ± 0.8        | 27.1 ± 1.3       | -39.6 ± 11.3      | 20.1 ± 6.6      | 0.566 ± 0.132   | 6.7 ± 0.7  |
| Normal | 1m         | HK 4-array v3.2 |         5 | -2.9 ± 0.7       | 40.6 ± 10.5      | -34.1 ± 12.9      | 21.2 ± 10.2     | 0.670 ± 0.221   | 8.5 ± 0.5  |
| Normal | 2m         | CAE v1.2        |        10 | 5.6 ± 2.6        | 26.2 ± 2.3       | -36.5 ± 9.6       | 17.8 ± 5.9      | 0.598 ± 0.161   | 6.7 ± 0.4  |
| Normal | 2m         | HK 4-array v3.2 |         5 | -4.2 ± 1.3       | 43.0 ± 6.4       | -35.0 ± 8.5       | 20.6 ± 5.0      | 0.769 ± 0.052   | 10.3 ± 0.9 |
| Normal | 3m         | CAE v1.2        |        10 | 6.2 ± 1.0        | 27.7 ± 1.4       | -41.6 ± 5.7       | 20.5 ± 5.4      | 0.652 ± 0.130   | 6.3 ± 0.6  |
| Normal | 3m         | HK 4-array v3.2 |         5 | -4.0 ± 0.9       | 45.9 ± 1.9       | -30.6 ± 3.9       | 18.1 ± 2.1      | 0.775 ± 0.026   | 9.2 ± 0.7  |
| Echo   | 1m         | CAE v1.2        |        11 | -17.5 ± 3.9      | 11.9 ± 6.8       | -39.9 ± 22.3      | 11.0 ± 6.6      | 0.422 ± 0.193   | 8.9 ± 1.9  |
| Echo   | 1m         | HK 4-array v3.2 |         5 | -15.6 ± 1.3      | 22.0 ± 9.4       | -26.9 ± 7.6       | 9.9 ± 4.8       | 0.479 ± 0.092   | 11.1 ± 0.9 |
| Echo   | 2m         | CAE v1.2        |         5 | -17.3 ± 3.0      | 13.3 ± 2.8       | -22.0 ± 11.9      | 4.4 ± 3.0       | 0.348 ± 0.082   | 7.7 ± 1.3  |
| Echo   | 2m         | HK 4-array v3.2 |         5 | -18.0 ± 2.0      | 17.5 ± 9.2       | -23.9 ± 9.7       | 9.8 ± 7.7       | 0.516 ± 0.182   | 8.3 ± 1.4  |
| Echo   | 3m         | CAE v1.2        |         5 | -17.4 ± 2.3      | 15.7 ± 3.1       | -34.7 ± 6.6       | 9.4 ± 2.0       | 0.492 ± 0.177   | 10.3 ± 2.5 |
| Echo   | 3m         | HK 4-array v3.2 |         5 | -19.1 ± 1.4      | 27.1 ± 10.0      | -33.7 ± 6.5       | 15.2 ± 6.9      | 0.524 ± 0.206   | 8.2 ± 0.8  |

## 8. Echo Impact vs No-Echo Condition

Delta columns are computed as the echo-condition mean minus the no-echo-condition mean for the same microphone and distance.

| Mic             | Distance   |   N normal |   N echo |   Delta LUFS |   Delta RMS dB |   Delta SNR dB |   Delta active pp |   Delta dropout |   Delta leading ms |   Delta echo index |
|:----------------|:-----------|-----------:|---------:|-------------:|---------------:|---------------:|------------------:|----------------:|-------------------:|-------------------:|
| HK 4-array v3.2 | 1m         |          5 |        5 |        5.417 |          4.799 |        -11.067 |             7.263 |         -11.326 |           -480     |             -0.236 |
| HK 4-array v3.2 | 2m         |          5 |        5 |        3.866 |          3.721 |        -17.009 |            11.113 |         -10.829 |              8     |             -0.234 |
| HK 4-array v3.2 | 3m         |          5 |        5 |        2.699 |          1.708 |        -10.201 |            -3.084 |          -2.943 |            -28     |             -0.143 |
| CAE v1.2        | 1m         |         10 |       11 |       -4.846 |         -5.71  |         -7.414 |            -0.24  |          -9.086 |           -272.364 |             -0.132 |
| CAE v1.2        | 2m         |         10 |        5 |       -4.472 |         -4.734 |         -9.558 |            14.549 |         -13.427 |           -230     |             -0.078 |
| CAE v1.2        | 3m         |         10 |        5 |       -3.516 |         -4.707 |         -6.548 |             6.961 |         -11.053 |           -173     |             -0.125 |

## 9. Echo Information-Loss Risk

This section focuses on indicators that are closer to the listening observation: suppression, start loss, or strong reduction of speech content while the robot speaker is playing.

| Mic             |   N echo |   Avg echo SNR dB |   Avg echo active % |   Avg echo envelope corr |   Avg LUFS drop vs normal |   Low loudness files |   Low active files |   Low speech-band files |   Low SNR files |   Low corr files |   Long duration files |   Risk files >=2 flags |
|:----------------|---------:|------------------:|--------------------:|-------------------------:|--------------------------:|---------------------:|-------------------:|------------------------:|----------------:|-----------------:|----------------------:|-----------------------:|
| HK 4-array v3.2 |       15 |              42   |                71.8 |                    0.506 |                       4   |                    0 |                  0 |                       0 |               1 |                0 |                     0 |                      0 |
| CAE v1.2        |       21 |              31.5 |                65.6 |                    0.421 |                      -4.4 |                    7 |                  4 |                       1 |               3 |                6 |                     1 |                      8 |

### 9.1 Echo Files with High Risk Flags

| Mic                        | Distance   |   Trial |   s |   LUFS |   LUFS drop |   SNR |   Active % |   Speech % |   Env corr |   Leading ms |   Dropout |   Risk flags | Audio path                       |
|:---------------------------|:-----------|--------:|----:|-------:|------------:|------:|-----------:|-----------:|-----------:|-------------:|----------:|-------------:|:---------------------------------|
| CAE v1.2                   | 1m         |      10 |   3 | -31.67 |      -11.56 | 36.06 |      39.13 |       6.5  |       0.49 |            0 |     20.25 |            4 | micanhthao/micmoi/1m/10_echo.wav |
| CAE v1.2                   | 1m         |      11 |  32 | -24.11 |       -4    | 35.25 |      38.04 |      92.24 |       0.16 |           90 |     14.94 |            4 | micanhthao/micmoi/1m/11_echo.wav |
| CAE v1.2                   | 1m         |       1 |   5 | -29.16 |       -9.05 | 29.02 |      37.07 |      91.22 |       0.23 |            0 |     15.99 |            3 | micanhthao/micmoi/1m/1_echo.wav  |
| CAE v1.2                   | 2m         |       4 |   5 | -26.53 |       -6.08 | 27.79 |      80.96 |      91.91 |       0.34 |            0 |      3.09 |            2 | micanhthao/micmoi/2m/4_echo.wav  |
| CAE v1.2                   | 1m         |       6 |   4 | -29.99 |       -9.87 | 28.61 |      52.63 |      98.36 |       0.55 |          300 |     14.4  |            2 | micanhthao/micmoi/1m/6_echo.wav  |
| CAE v1.2                   | 2m         |       1 |   5 | -28.55 |       -8.1  | 29.13 |      58.52 |      86.47 |       0.28 |            0 |      7.58 |            2 | micanhthao/micmoi/2m/1_echo.wav  |
| CAE v1.2                   | 3m         |       4 |   2 | -25.9  |       -5.58 | 31.67 |      62.81 |      98.07 |       0.28 |           10 |     12.41 |            2 | micanhthao/micmoi/3m/4_echo.wav  |
| CAE v1.2                   | 1m         |       3 |   4 | -23.87 |       -3.76 | 20.91 |      99.25 |      90.31 |       0.23 |            0 |      0    |            2 | micanhthao/micmoi/1m/3_echo.wav  |
| CAE v1.2                   | 3m         |       1 |   6 | -25.71 |       -5.39 | 31.37 |      60.43 |      95.81 |       0.45 |          190 |      8.06 |            1 | micanhthao/micmoi/3m/1_echo.wav  |
| CAE v1.2                   | 1m         |       8 |   3 | -20.92 |       -0.8  | 37.91 |      36.45 |      96.39 |       0.86 |           70 |     18.85 |            1 | micanhthao/micmoi/1m/8_echo.wav  |
| CAE v1.2                   | 2m         |       5 |   4 | -22.4  |       -1.95 | 27.18 |      90.73 |      95.8  |       0.4  |            0 |      0.88 |            1 | micanhthao/micmoi/2m/5_echo.wav  |
| HK mic 4 array linear v3.2 | 2m         |       3 |   3 | -25.96 |        2.27 | 26.21 |      85.62 |      79.75 |       0.58 |            0 |      3.04 |            1 | michk_32/hk/2m/3_echo.wav        |
| CAE v1.2                   | 2m         |       2 |   5 | -24.61 |       -4.16 | 31.74 |      78.36 |      87.5  |       0.26 |            0 |      7.51 |            1 | micanhthao/micmoi/2m/2_echo.wav  |
| HK mic 4 array linear v3.2 | 2m         |       1 |   4 | -20.94 |        7.29 | 35.13 |      79.95 |      94.7  |       0.51 |          310 |      8.66 |            1 | michk_32/hk/2m/1_echo.wav        |

## 10. Waveform and Energy Evidence for Echo Handling

The figures below plot waveform and RMS energy envelope for the real microphone and the simultaneous OBS/laptop recording in the same echo file. Lower real-microphone energy than OBS is not a true ERLE measurement by itself because gain and microphone placement also matter; however, it is useful visual evidence for suppression, missing segments, low active ratio, or speech preservation while the robot speaker is playing.

### 10.1 Representative Figures

**HK 4-array v3.2 | 3m | trial 4 | path `michk_32/hk/3m/4_echo.wav` | risk flags 1**

![HK 4-array v3.2 3m 4](figures/echo_wave_energy/hk_3m_4_echo_wave_energy.png)

**HK 4-array v3.2 | 2m | trial 2 | path `michk_32/hk/2m/2_echo.wav` | risk flags 1**

![HK 4-array v3.2 2m 2](figures/echo_wave_energy/hk_2m_2_echo_wave_energy.png)

**CAE v1.2 | 1m | trial 11 | path `micanhthao/micmoi/1m/11_echo.wav` | risk flags 4**

![CAE v1.2 1m 11](figures/echo_wave_energy/cae_1m_11_echo_wave_energy.png)

**CAE v1.2 | 1m | trial 10 | path `micanhthao/micmoi/1m/10_echo.wav` | risk flags 4**

![CAE v1.2 1m 10](figures/echo_wave_energy/cae_1m_10_echo_wave_energy.png)

**CAE v1.2 | 1m | trial 1 | path `micanhthao/micmoi/1m/1_echo.wav` | risk flags 3**

![CAE v1.2 1m 1](figures/echo_wave_energy/cae_1m_1_echo_wave_energy.png)

### 10.2 Full Waveform/Energy Figure Index for Echo Files

| Mic             | Distance   |   Trial | Mic audio                        | OBS audio                     |   LUFS |   LUFS drop |   SNR |   Active % |   Speech % |   Risk flags | Wave/Energy figure                                             |
|:----------------|:-----------|--------:|:---------------------------------|:------------------------------|-------:|------------:|------:|-----------:|-----------:|-------------:|:---------------------------------------------------------------|
| HK 4-array v3.2 | 1m         |       1 | michk_32/hk/1m/1_echo.wav        | michk_32/obs/1m/1_echo.wav    | -23.68 |        3.27 | 36.37 |      74.25 |      87.66 |            0 | [PNG](figures/echo_wave_energy/hk_1m_1_echo_wave_energy.png)   |
| HK 4-array v3.2 | 1m         |       2 | michk_32/hk/1m/2_echo.wav        | michk_32/obs/1m/2_echo.wav    | -20.72 |        6.24 | 59.9  |      64.88 |      86.28 |            0 | [PNG](figures/echo_wave_energy/hk_1m_2_echo_wave_energy.png)   |
| HK 4-array v3.2 | 1m         |       3 | michk_32/hk/1m/3_echo.wav        | michk_32/obs/1m/3_echo.wav    | -21.15 |        5.8  | 37.66 |      72.34 |      79.71 |            0 | [PNG](figures/echo_wave_energy/hk_1m_3_echo_wave_energy.png)   |
| HK 4-array v3.2 | 1m         |       4 | michk_32/hk/1m/4_echo.wav        | michk_32/obs/1m/4_echo.wav    | -20.86 |        6.1  | 31.65 |      85.21 |      87.49 |            0 | [PNG](figures/echo_wave_energy/hk_1m_4_echo_wave_energy.png)   |
| HK 4-array v3.2 | 1m         |       5 | michk_32/hk/1m/5_echo.wav        | michk_32/obs/1m/5_echo.wav    | -21.28 |        5.68 | 39.71 |      68.92 |      81.22 |            0 | [PNG](figures/echo_wave_energy/hk_1m_5_echo_wave_energy.png)   |
| HK 4-array v3.2 | 2m         |       1 | michk_32/hk/2m/1_echo.wav        | michk_32/obs/2m/1_echo.wav    | -20.94 |        7.29 | 35.13 |      79.95 |      94.7  |            1 | [PNG](figures/echo_wave_energy/hk_2m_1_echo_wave_energy.png)   |
| HK 4-array v3.2 | 2m         |       2 | michk_32/hk/2m/2_echo.wav        | michk_32/obs/2m/2_echo.wav    | -25.89 |        2.33 | 57.35 |      62.66 |      94.13 |            1 | [PNG](figures/echo_wave_energy/hk_2m_2_echo_wave_energy.png)   |
| HK 4-array v3.2 | 2m         |       3 | michk_32/hk/2m/3_echo.wav        | michk_32/obs/2m/3_echo.wav    | -25.96 |        2.27 | 26.21 |      85.62 |      79.75 |            1 | [PNG](figures/echo_wave_energy/hk_2m_3_echo_wave_energy.png)   |
| HK 4-array v3.2 | 2m         |       4 | michk_32/hk/2m/4_echo.wav        | michk_32/obs/2m/4_echo.wav    | -23.36 |        4.87 | 29.85 |      82.94 |      87.22 |            0 | [PNG](figures/echo_wave_energy/hk_2m_4_echo_wave_energy.png)   |
| HK 4-array v3.2 | 2m         |       5 | michk_32/hk/2m/5_echo.wav        | michk_32/obs/2m/5_echo.wav    | -25.67 |        2.56 | 39.85 |      69.28 |      85.45 |            0 | [PNG](figures/echo_wave_energy/hk_2m_5_echo_wave_energy.png)   |
| HK 4-array v3.2 | 3m         |       1 | michk_32/hk/3m/1_echo.wav        | michk_32/obs/3m/1_echo.wav    | -25.75 |        1.85 | 36.97 |      74.69 |      93.97 |            0 | [PNG](figures/echo_wave_energy/hk_3m_1_echo_wave_energy.png)   |
| HK 4-array v3.2 | 3m         |       2 | michk_32/hk/3m/2_echo.wav        | michk_32/obs/3m/2_echo.wav    | -23.16 |        4.44 | 36.26 |      71.43 |      93.63 |            0 | [PNG](figures/echo_wave_energy/hk_3m_2_echo_wave_energy.png)   |
| HK 4-array v3.2 | 3m         |       3 | michk_32/hk/3m/3_echo.wav        | michk_32/obs/3m/3_echo.wav    | -25.62 |        1.98 | 53.78 |      62.91 |      90.92 |            1 | [PNG](figures/echo_wave_energy/hk_3m_3_echo_wave_energy.png)   |
| HK 4-array v3.2 | 3m         |       4 | michk_32/hk/3m/4_echo.wav        | michk_32/obs/3m/4_echo.wav    | -25.26 |        2.33 | 57.57 |      58.92 |      91.08 |            1 | [PNG](figures/echo_wave_energy/hk_3m_4_echo_wave_energy.png)   |
| HK 4-array v3.2 | 3m         |       5 | michk_32/hk/3m/5_echo.wav        | michk_32/obs/3m/5_echo.wav    | -24.69 |        2.9  | 51.94 |      63.53 |      86.27 |            0 | [PNG](figures/echo_wave_energy/hk_3m_5_echo_wave_energy.png)   |
| CAE v1.2        | 1m         |       1 | micanhthao/micmoi/1m/1_echo.wav  | micanhthao/obs/1m/1_echo.wav  | -29.16 |       -9.05 | 29.02 |      37.07 |      91.22 |            3 | [PNG](figures/echo_wave_energy/cae_1m_1_echo_wave_energy.png)  |
| CAE v1.2        | 1m         |       2 | micanhthao/micmoi/1m/2_echo.wav  | micanhthao/obs/1m/2_echo.wav  | -22.76 |       -2.65 | 34.35 |      55.09 |      93.44 |            0 | [PNG](figures/echo_wave_energy/cae_1m_2_echo_wave_energy.png)  |
| CAE v1.2        | 1m         |       3 | micanhthao/micmoi/1m/3_echo.wav  | micanhthao/obs/1m/3_echo.wav  | -23.87 |       -3.76 | 20.91 |      99.25 |      90.31 |            2 | [PNG](figures/echo_wave_energy/cae_1m_3_echo_wave_energy.png)  |
| CAE v1.2        | 1m         |       4 | micanhthao/micmoi/1m/4_echo.wav  | micanhthao/obs/1m/4_echo.wav  | -21.73 |       -1.61 | 36.07 |      72.29 |      93.95 |            0 | [PNG](figures/echo_wave_energy/cae_1m_4_echo_wave_energy.png)  |
| CAE v1.2        | 1m         |       5 | micanhthao/micmoi/1m/5_echo.wav  | micanhthao/obs/1m/5_echo.wav  | -24.89 |       -4.77 | 32.85 |      63.16 |      96.52 |            0 | [PNG](figures/echo_wave_energy/cae_1m_5_echo_wave_energy.png)  |
| CAE v1.2        | 1m         |       6 | micanhthao/micmoi/1m/6_echo.wav  | micanhthao/obs/1m/6_echo.wav  | -29.99 |       -9.87 | 28.61 |      52.63 |      98.36 |            2 | [PNG](figures/echo_wave_energy/cae_1m_6_echo_wave_energy.png)  |
| CAE v1.2        | 1m         |       7 | micanhthao/micmoi/1m/7_echo.wav  | micanhthao/obs/1m/7_echo.wav  | -22.96 |       -2.85 | 28.61 |      86.72 |      90.73 |            0 | [PNG](figures/echo_wave_energy/cae_1m_7_echo_wave_energy.png)  |
| CAE v1.2        | 1m         |       8 | micanhthao/micmoi/1m/8_echo.wav  | micanhthao/obs/1m/8_echo.wav  | -20.92 |       -0.8  | 37.91 |      36.45 |      96.39 |            1 | [PNG](figures/echo_wave_energy/cae_1m_8_echo_wave_energy.png)  |
| CAE v1.2        | 1m         |       9 | micanhthao/micmoi/1m/9_echo.wav  | micanhthao/obs/1m/9_echo.wav  | -22.51 |       -2.4  | 31.47 |      81.61 |      91.36 |            0 | [PNG](figures/echo_wave_energy/cae_1m_9_echo_wave_energy.png)  |
| CAE v1.2        | 1m         |      10 | micanhthao/micmoi/1m/10_echo.wav | micanhthao/obs/1m/10_echo.wav | -31.67 |      -11.56 | 36.06 |      39.13 |       6.5  |            4 | [PNG](figures/echo_wave_energy/cae_1m_10_echo_wave_energy.png) |
| CAE v1.2        | 1m         |      11 | micanhthao/micmoi/1m/11_echo.wav | micanhthao/obs/1m/11_echo.wav | -24.11 |       -4    | 35.25 |      38.04 |      92.24 |            4 | [PNG](figures/echo_wave_energy/cae_1m_11_echo_wave_energy.png) |
| CAE v1.2        | 2m         |       1 | micanhthao/micmoi/2m/1_echo.wav  | micanhthao/obs/2m/1_echo.wav  | -28.55 |       -8.1  | 29.13 |      58.52 |      86.47 |            2 | [PNG](figures/echo_wave_energy/cae_2m_1_echo_wave_energy.png)  |
| CAE v1.2        | 2m         |       2 | micanhthao/micmoi/2m/2_echo.wav  | micanhthao/obs/2m/2_echo.wav  | -24.61 |       -4.16 | 31.74 |      78.36 |      87.5  |            1 | [PNG](figures/echo_wave_energy/cae_2m_2_echo_wave_energy.png)  |
| CAE v1.2        | 2m         |       3 | micanhthao/micmoi/2m/3_echo.wav  | micanhthao/obs/2m/3_echo.wav  | -22.52 |       -2.07 | 30.48 |      81.45 |      87.52 |            0 | [PNG](figures/echo_wave_energy/cae_2m_3_echo_wave_energy.png)  |
| CAE v1.2        | 2m         |       4 | micanhthao/micmoi/2m/4_echo.wav  | micanhthao/obs/2m/4_echo.wav  | -26.53 |       -6.08 | 27.79 |      80.96 |      91.91 |            2 | [PNG](figures/echo_wave_energy/cae_2m_4_echo_wave_energy.png)  |
| CAE v1.2        | 2m         |       5 | micanhthao/micmoi/2m/5_echo.wav  | micanhthao/obs/2m/5_echo.wav  | -22.4  |       -1.95 | 27.18 |      90.73 |      95.8  |            1 | [PNG](figures/echo_wave_energy/cae_2m_5_echo_wave_energy.png)  |
| CAE v1.2        | 3m         |       1 | micanhthao/micmoi/3m/1_echo.wav  | micanhthao/obs/3m/1_echo.wav  | -25.71 |       -5.39 | 31.37 |      60.43 |      95.81 |            1 | [PNG](figures/echo_wave_energy/cae_3m_1_echo_wave_energy.png)  |
| CAE v1.2        | 3m         |       2 | micanhthao/micmoi/3m/2_echo.wav  | micanhthao/obs/3m/2_echo.wav  | -21.84 |       -1.52 | 33.93 |      73.93 |      96.39 |            0 | [PNG](figures/echo_wave_energy/cae_3m_2_echo_wave_energy.png)  |
| CAE v1.2        | 3m         |       3 | micanhthao/micmoi/3m/3_echo.wav  | micanhthao/obs/3m/3_echo.wav  | -23.22 |       -2.9  | 33.99 |      58.86 |      97.6  |            0 | [PNG](figures/echo_wave_energy/cae_3m_3_echo_wave_energy.png)  |
| CAE v1.2        | 3m         |       4 | micanhthao/micmoi/3m/4_echo.wav  | micanhthao/obs/3m/4_echo.wav  | -25.9  |       -5.58 | 31.67 |      62.81 |      98.07 |            2 | [PNG](figures/echo_wave_energy/cae_3m_4_echo_wave_energy.png)  |
| CAE v1.2        | 3m         |       5 | micanhthao/micmoi/3m/5_echo.wav  | micanhthao/obs/3m/5_echo.wav  | -22.51 |       -2.19 | 32.35 |      70.68 |      97.87 |            0 | [PNG](figures/echo_wave_energy/cae_3m_5_echo_wave_energy.png)  |

## 11. Files with the Strongest Energy Dropout / Silence Evidence

| Mic                        | Distance   | Mode   |   Trial |   s |   LUFS |   SNR |   Active % |   Leading ms |   Longest silence ms |   Dropout | Audio path                 |
|:---------------------------|:-----------|:-------|--------:|----:|-------:|------:|-----------:|-------------:|---------------------:|----------:|:---------------------------|
| HK mic 4 array linear v3.2 | 1m         | Normal |       5 |   4 | -27.77 | 59.58 |      44.61 |         1470 |                 1470 |     38.51 | michk_32/hk/1m/5.wav       |
| HK mic 4 array linear v3.2 | 3m         | Echo   |       4 |   5 | -25.26 | 57.57 |      58.92 |         1090 |                 1090 |     24.15 | michk_32/hk/3m/4_echo.wav  |
| CAE v1.2                   | 1m         | Normal |       1 |   5 | -19.24 | 41.01 |      49.1  |          750 |                  950 |     26.36 | micanhthao/micmoi/1m/1.wav |
| CAE v1.2                   | 3m         | Normal |       8 |   6 | -20.46 | 39.2  |      62.44 |          680 |                  920 |     19.51 | micanhthao/micmoi/3m/8.wav |
| CAE v1.2                   | 1m         | Normal |       4 |   3 | -19.94 | 39.94 |      49.5  |          610 |                  710 |     28.31 | micanhthao/micmoi/1m/4.wav |
| CAE v1.2                   | 1m         | Normal |       2 |   4 | -20.59 | 39.99 |      47.12 |          600 |                  620 |     24.76 | micanhthao/micmoi/1m/2.wav |
| CAE v1.2                   | 2m         | Normal |       2 |   4 | -19.65 | 40.31 |      59.4  |          600 |                  690 |     21.25 | micanhthao/micmoi/2m/2.wav |
| HK mic 4 array linear v3.2 | 1m         | Normal |       1 |   6 | -26.17 | 60.41 |      68.78 |          540 |                 1210 |     19.33 | michk_32/hk/1m/1.wav       |
| HK mic 4 array linear v3.2 | 3m         | Normal |       4 |   5 | -27.2  | 59.51 |      63.93 |          540 |                  920 |     19.49 | michk_32/hk/3m/4.wav       |
| HK mic 4 array linear v3.2 | 1m         | Normal |       3 |   5 | -27.19 | 59.37 |      64.13 |          500 |                 1040 |     20.94 | michk_32/hk/1m/3.wav       |
| CAE v1.2                   | 3m         | Normal |       7 |   6 | -20.54 | 38.94 |      58.76 |          470 |                 1030 |     20.56 | micanhthao/micmoi/3m/7.wav |
| HK mic 4 array linear v3.2 | 3m         | Normal |       3 |   4 | -26.86 | 54.53 |      72.68 |          520 |                  520 |     15.41 | michk_32/hk/3m/3.wav       |

## 12. Technical and Operational Assessment

### HK mic 4 array linear v3.2

- No echo: usable capture quality at 1 m, 2 m, and 3 m.
- Echo: less information loss than CAE, although mild distortion is still expected when AEC / echo suppression is active.
- Deployment: compact, fewer external accessories, and built-in tuning software, making it more suitable for repeatable robot integration.

### CAE v1.2

- No echo: usable capture quality, matching the subjective observation that both microphones are acceptable without robot-speaker playback.
- Echo: higher risk of speech suppression / information loss in some files. In this dataset, no file is fully muted by the absolute-energy threshold, but multiple CAE echo files show loudness drops, low active ratio, abnormal speech-band energy, or low correlation with the OBS reference; these are listed in the echo risk table.
- Deployment: requires an external sound card and additional wiring/hardware, increasing integration complexity.

## 13. Generated Figures

- `figures/snr_hk_vs_cae.png`: SNR by distance and mode.
- `figures/lufs_hk_vs_cae.png`: LUFS by distance and mode.
- `figures/dropout_hk_vs_cae.png`: dropout score by distance and mode.
- `figures/leading_silence_hk_vs_cae.png`: leading audio loss by distance and mode.
- `figures/pair_rms_delta_mic_obs.png`: real microphone RMS delta against laptop reference.
- `figures/pair_active_delta_mic_obs.png`: real microphone active-ratio delta against laptop reference.
- `figures/pair_envelope_corr_mic_obs.png`: real microphone/laptop envelope correlation.
- `figures/echo_wave_energy/*.png`: waveform and RMS energy envelope for each echo pair.

## 14. Per-File Real-Microphone Details

| Mic                        | Distance   | Mode   |   Trial |    s |   LUFS |    RMS |   Peak |   SNR |   Active % |   Leading ms |   Longest silence ms |   Dropout |   Speech % |   Echo idx |   Clip % | Audio path                       |
|:---------------------------|:-----------|:-------|--------:|-----:|-------:|-------:|-------:|------:|-----------:|-------------:|---------------------:|----------:|-----------:|-----------:|---------:|:---------------------------------|
| HK mic 4 array linear v3.2 | 1m         | Normal |       1 |  6   | -26.17 | -26.48 |  -9.29 | 60.41 |      68.78 |          540 |                 1210 |     19.33 |      58.68 |       0.9  |        0 | michk_32/hk/1m/1.wav             |
| HK mic 4 array linear v3.2 | 1m         | Normal |       2 |  4   | -26.64 | -26.8  |  -8.45 | 41.35 |      77.19 |            0 |                  780 |     13.68 |      62.38 |       0.9  |        0 | michk_32/hk/1m/2.wav             |
| HK mic 4 array linear v3.2 | 1m         | Normal |       3 |  5   | -27.19 | -28.01 | -10.04 | 59.37 |      64.13 |          500 |                 1040 |     20.94 |      59.85 |       0.89 |        0 | michk_32/hk/1m/3.wav             |
| HK mic 4 array linear v3.2 | 1m         | Normal |       4 |  3   | -27.01 | -27.55 | -10.03 | 39.91 |      74.58 |           50 |                  530 |     13.68 |      75.2  |       0.77 |        0 | michk_32/hk/1m/4.wav             |
| HK mic 4 array linear v3.2 | 1m         | Normal |       5 |  4   | -27.77 | -30.13 |  -7.82 | 59.58 |      44.61 |         1470 |                 1470 |     38.51 |      83.53 |       0.88 |        0 | michk_32/hk/1m/5.wav             |
| CAE v1.2                   | 1m         | Normal |       1 |  5   | -19.24 | -21.18 |  -1.81 | 41.01 |      49.1  |          750 |                  950 |     26.36 |      45.97 |       0.92 |        0 | micanhthao/micmoi/1m/1.wav       |
| CAE v1.2                   | 1m         | Normal |       2 |  4   | -20.59 | -22.45 |  -1.83 | 39.99 |      47.12 |          600 |                  620 |     24.76 |      56.36 |       0.92 |        0 | micanhthao/micmoi/1m/2.wav       |
| CAE v1.2                   | 1m         | Normal |       3 |  4   | -19.93 | -21.07 |  -5.34 | 39.76 |      61.15 |          180 |                  780 |     19.32 |      64.13 |       0.91 |        0 | micanhthao/micmoi/1m/3.wav       |
| CAE v1.2                   | 1m         | Normal |       4 |  3   | -19.94 | -21.86 |  -4.55 | 39.94 |      49.5  |          610 |                  710 |     28.31 |      56.2  |       0.91 |        0 | micanhthao/micmoi/1m/4.wav       |
| CAE v1.2                   | 1m         | Normal |       5 |  4   | -20.11 | -20.62 |  -3.9  | 39.61 |      69.92 |            0 |                  820 |     16.49 |      50.83 |       0.91 |        0 | micanhthao/micmoi/1m/5.wav       |
| CAE v1.2                   | 1m         | Normal |       6 |  4   | -20.54 | -22.07 |  -4.55 | 39.72 |      55.14 |          340 |                  820 |     22.84 |      57.95 |       0.91 |        0 | micanhthao/micmoi/1m/6.wav       |
| CAE v1.2                   | 1m         | Normal |       7 |  3   | -20.16 | -20.3  |  -5.34 | 38.19 |      73.91 |          380 |                  380 |     13.19 |      38.66 |       0.84 |        0 | micanhthao/micmoi/1m/7.wav       |
| CAE v1.2                   | 1m         | Normal |       8 |  3   | -20.21 | -20.1  |  -4.65 | 35.98 |      80.6  |            0 |                  160 |      6.67 |      52.75 |       0.86 |        0 | micanhthao/micmoi/1m/8.wav       |
| CAE v1.2                   | 1m         | Normal |       9 |  5   | -20.18 | -21.74 |  -4.34 | 39.8  |      57.11 |          360 |                  690 |     19.61 |      60.04 |       0.9  |        0 | micanhthao/micmoi/1m/9.wav       |
| CAE v1.2                   | 1m         | Normal |      10 |  4   | -20.23 | -21.69 |  -4.52 | 39.35 |      60.15 |           40 |                 1230 |     23.27 |      41.98 |       0.89 |        0 | micanhthao/micmoi/1m/10.wav      |
| HK mic 4 array linear v3.2 | 2m         | Normal |       1 |  4   | -27.87 | -28.61 | -12.77 | 57.69 |      67.92 |          200 |                  950 |     19.69 |      73.42 |       0.87 |        0 | michk_32/hk/2m/1.wav             |
| HK mic 4 array linear v3.2 | 2m         | Normal |       2 |  4   | -28.1  | -28.66 | -11.23 | 58.14 |      68.67 |           40 |                 1010 |     18.97 |      80.25 |       0.85 |        0 | michk_32/hk/2m/2.wav             |
| HK mic 4 array linear v3.2 | 2m         | Normal |       3 |  4   | -27.41 | -28.36 |  -8.35 | 59.13 |      57.89 |          240 |                 1050 |     22.69 |      79.79 |       0.85 |        0 | michk_32/hk/2m/3.wav             |
| HK mic 4 array linear v3.2 | 2m         | Normal |       4 |  5   | -26.55 | -26.66 |  -9.84 | 43.33 |      75.55 |           40 |                  960 |     14.19 |      90.43 |       0.81 |        0 | michk_32/hk/2m/4.wav             |
| HK mic 4 array linear v3.2 | 2m         | Normal |       5 |  3   | -31.23 | -31.94 | -10.41 | 55.15 |      54.85 |            0 |                 1150 |     27.63 |      78.8  |       0.89 |        0 | michk_32/hk/2m/5.wav             |
| CAE v1.2                   | 2m         | Normal |       1 |  4   | -20.65 | -22.49 |  -2.22 | 39.23 |      53.88 |          280 |                  610 |     20.79 |      57.14 |       0.9  |        0 | micanhthao/micmoi/2m/1.wav       |
| CAE v1.2                   | 2m         | Normal |       2 |  4   | -19.65 | -20.92 |  -4.71 | 40.31 |      59.4  |          600 |                  690 |     21.25 |      71.3  |       0.89 |        0 | micanhthao/micmoi/2m/2.wav       |
| CAE v1.2                   | 2m         | Normal |       3 |  4   | -18.91 | -20.54 |  -2.78 | 41.12 |      52.63 |          190 |                 1310 |     27.36 |      58    |       0.9  |        0 | micanhthao/micmoi/2m/3.wav       |
| CAE v1.2                   | 2m         | Normal |       4 |  4.5 | -21.34 | -22.99 |  -5.06 | 38.26 |      56.79 |            0 |                 1300 |     24.01 |      47.78 |       0.9  |        0 | micanhthao/micmoi/2m/4.wav       |
| CAE v1.2                   | 2m         | Normal |       5 |  4   | -19.7  | -20.12 |  -4.05 | 39.14 |      75.69 |          340 |                  410 |     12.93 |      52.63 |       0.77 |        0 | micanhthao/micmoi/2m/5.wav       |
| CAE v1.2                   | 2m         | Normal |       6 |  4.5 | -20.54 | -20.73 |  -3.07 | 36.75 |      78.84 |           90 |                  490 |      9.68 |      55.19 |       0.76 |        0 | micanhthao/micmoi/2m/6.wav       |
| CAE v1.2                   | 2m         | Normal |       7 |  4   | -19.77 | -20.68 |  -2.67 | 39.88 |      64.41 |          340 |                  730 |     18.37 |      53.24 |       0.84 |        0 | micanhthao/micmoi/2m/7.wav       |
| CAE v1.2                   | 2m         | Normal |       8 |  4   | -20.52 | -22.21 |  -3.3  | 39.56 |      53.88 |          150 |                  540 |     18.38 |      60.62 |       0.88 |        0 | micanhthao/micmoi/2m/8.wav       |
| CAE v1.2                   | 2m         | Normal |       9 |  4   | -20.59 | -21.44 |  -4.68 | 38.47 |      67.67 |          310 |                  620 |     16.47 |      77.2  |       0.83 |        0 | micanhthao/micmoi/2m/9.wav       |
| CAE v1.2                   | 2m         | Normal |      10 |  5   | -22.81 | -23.68 |  -7.06 | 35.51 |      71.34 |            0 |                  280 |      9.26 |      66.53 |       0.78 |        0 | micanhthao/micmoi/2m/10.wav      |
| HK mic 4 array linear v3.2 | 3m         | Normal |       1 |  4   | -27.09 | -27.71 | -10.64 | 57.8  |      73.43 |            0 |                  930 |     16.4  |      86.81 |       0.91 |        0 | michk_32/hk/3m/1.wav             |
| HK mic 4 array linear v3.2 | 3m         | Normal |       2 |  4   | -28.17 | -28.73 | -12.17 | 58.66 |      68.92 |          220 |                  900 |     18.82 |      89.04 |       0.84 |        0 | michk_32/hk/3m/2.wav             |
| HK mic 4 array linear v3.2 | 3m         | Normal |       3 |  4   | -26.86 | -27.27 |  -9.36 | 54.53 |      72.68 |          520 |                  520 |     15.41 |      93.71 |       0.81 |        0 | michk_32/hk/3m/3.wav             |
| HK mic 4 array linear v3.2 | 3m         | Normal |       4 |  5   | -27.2  | -27.85 | -10.54 | 59.51 |      63.93 |          540 |                  920 |     19.49 |      93.4  |       0.86 |        0 | michk_32/hk/3m/4.wav             |
| HK mic 4 array linear v3.2 | 3m         | Normal |       5 |  4   | -28.67 | -29.55 | -13    | 57.02 |      67.92 |            0 |                 1160 |     20.44 |      92.9  |       0.9  |        0 | michk_32/hk/3m/5.wav             |
| CAE v1.2                   | 3m         | Normal |       1 |  3.5 | -20.33 | -21.6  |  -4.02 | 38.76 |      63.04 |          310 |                  590 |     17.42 |      43.15 |       0.87 |        0 | micanhthao/micmoi/3m/1.wav       |
| CAE v1.2                   | 3m         | Normal |       2 |  3.5 | -19.34 | -21.1  |  -4.64 | 39.86 |      58.45 |            0 |                 1210 |     24.77 |      59.88 |       0.91 |        0 | micanhthao/micmoi/3m/2.wav       |
| CAE v1.2                   | 3m         | Normal |       3 |  5   | -19.94 | -21.85 |  -4.62 | 39.66 |      52.71 |          380 |                  510 |     19.34 |      41.81 |       0.92 |        0 | micanhthao/micmoi/3m/3.wav       |
| CAE v1.2                   | 3m         | Normal |       4 |  5   | -20.88 | -22.33 |  -2.83 | 38.61 |      61.72 |          120 |                  750 |     17.03 |      49.35 |       0.88 |        0 | micanhthao/micmoi/3m/4.wav       |
| CAE v1.2                   | 3m         | Normal |       5 |  3   | -18.49 | -19.43 |  -4.8  | 40.82 |      66.89 |            0 |                  340 |     12.41 |      52.22 |       0.87 |        0 | micanhthao/micmoi/3m/5.wav       |
| CAE v1.2                   | 3m         | Normal |       6 |  4   | -20.45 | -22.72 |  -5.59 | 39.66 |      48.37 |           80 |                 1710 |     32.42 |      50.15 |       0.9  |        0 | micanhthao/micmoi/3m/6.wav       |
| CAE v1.2                   | 3m         | Normal |       7 |  6   | -20.54 | -22.12 |  -2.44 | 38.94 |      58.76 |          470 |                 1030 |     20.56 |      53.76 |       0.91 |        0 | micanhthao/micmoi/3m/7.wav       |
| CAE v1.2                   | 3m         | Normal |       8 |  6   | -20.46 | -21.71 |  -4.44 | 39.2  |      62.44 |          680 |                  920 |     19.51 |      53.11 |       0.9  |        0 | micanhthao/micmoi/3m/8.wav       |
| CAE v1.2                   | 3m         | Normal |       9 |  6   | -22.2  | -24.36 |  -2.32 | 37.53 |      52.09 |          330 |                 1130 |     22.84 |      52.8  |       0.88 |        0 | micanhthao/micmoi/3m/9.wav       |
| CAE v1.2                   | 3m         | Normal |      10 |  7   | -20.54 | -21.86 |  -4.05 | 39.06 |      59.37 |            0 |                 1430 |     18.73 |      53.73 |       0.93 |        0 | micanhthao/micmoi/3m/10.wav      |
| HK mic 4 array linear v3.2 | 1m         | Echo   |       1 |  3   | -23.68 | -24.89 |  -2.43 | 36.37 |      74.25 |            0 |                  140 |      7.84 |      87.66 |       0.59 |        0 | michk_32/hk/1m/1_echo.wav        |
| HK mic 4 array linear v3.2 | 1m         | Echo   |       2 |  3   | -20.72 | -22.56 |  -2.35 | 59.9  |      64.88 |            0 |                  590 |     17.56 |      86.28 |       0.79 |        0 | michk_32/hk/1m/2_echo.wav        |
| HK mic 4 array linear v3.2 | 1m         | Echo   |       3 |  5   | -21.15 | -23.41 |  -2.17 | 37.66 |      72.34 |            0 |                  130 |      7.43 |      79.71 |       0.59 |        0 | michk_32/hk/1m/3_echo.wav        |
| HK mic 4 array linear v3.2 | 1m         | Echo   |       4 |  4   | -20.86 | -21.7  |  -2.86 | 31.65 |      85.21 |          160 |                  160 |      5.28 |      87.49 |       0.53 |        0 | michk_32/hk/1m/4_echo.wav        |
| HK mic 4 array linear v3.2 | 1m         | Echo   |       5 |  4   | -21.28 | -22.41 |  -3.1  | 39.71 |      68.92 |            0 |                  460 |     11.4  |      81.22 |       0.67 |        0 | michk_32/hk/1m/5_echo.wav        |
| CAE v1.2                   | 1m         | Echo   |       1 |  5   | -29.16 | -34.82 |  -9.5  | 29.02 |      37.07 |            0 |                  530 |     15.99 |      91.22 |       0.72 |        0 | micanhthao/micmoi/1m/1_echo.wav  |
| CAE v1.2                   | 1m         | Echo   |       2 |  6   | -22.76 | -25.77 |  -6.29 | 34.35 |      55.09 |            0 |                  180 |      7.89 |      93.44 |       0.79 |        0 | micanhthao/micmoi/1m/2_echo.wav  |
| CAE v1.2                   | 1m         | Echo   |       3 |  4   | -23.87 | -23.74 |  -7    | 20.91 |      99.25 |            0 |                    0 |      0    |      90.31 |       0.37 |        0 | micanhthao/micmoi/1m/3_echo.wav  |
| CAE v1.2                   | 1m         | Echo   |       4 |  6   | -21.73 | -23.2  |  -4.88 | 36.07 |      72.29 |            0 |                  630 |     10.34 |      93.95 |       0.84 |        0 | micanhthao/micmoi/1m/4_echo.wav  |
| CAE v1.2                   | 1m         | Echo   |       5 |  4   | -24.89 | -27.25 |  -5.22 | 32.85 |      63.16 |            0 |                  330 |     10.18 |      96.52 |       0.83 |        0 | micanhthao/micmoi/1m/5_echo.wav  |
| CAE v1.2                   | 1m         | Echo   |       6 |  4   | -29.99 | -32.27 |  -9.79 | 28.61 |      52.63 |          300 |                  300 |     14.4  |      98.36 |       0.79 |        0 | micanhthao/micmoi/1m/6_echo.wav  |
| CAE v1.2                   | 1m         | Echo   |       7 |  4   | -22.96 | -24.05 |  -6.85 | 28.61 |      86.72 |            0 |                  100 |      2.72 |      90.73 |       0.77 |        0 | micanhthao/micmoi/1m/7_echo.wav  |
| CAE v1.2                   | 1m         | Echo   |       8 |  3   | -20.92 | -25.21 |  -4.47 | 37.91 |      36.45 |           70 |                  390 |     18.85 |      96.39 |       0.87 |        0 | micanhthao/micmoi/1m/8_echo.wav  |
| CAE v1.2                   | 1m         | Echo   |       9 |  3   | -22.51 | -24.1  |  -2.79 | 31.47 |      81.61 |          130 |                  130 |      5.43 |      91.36 |       0.77 |        0 | micanhthao/micmoi/1m/9_echo.wav  |
| CAE v1.2                   | 1m         | Echo   |      10 |  3   | -31.67 | -27.81 |  -4.62 | 36.06 |      39.13 |            0 |                  520 |     20.25 |       6.5  |       0.87 |        0 | micanhthao/micmoi/1m/10_echo.wav |
| CAE v1.2                   | 1m         | Echo   |      11 | 32   | -24.11 | -28.98 |  -3.44 | 35.25 |      38.04 |           90 |                 1280 |     14.94 |      92.24 |       0.81 |        0 | micanhthao/micmoi/1m/11_echo.wav |
| HK mic 4 array linear v3.2 | 2m         | Echo   |       1 |  4   | -20.94 | -22.29 |  -4.07 | 35.13 |      79.95 |          310 |                  310 |      8.66 |      94.7  |       0.68 |        0 | michk_32/hk/2m/1_echo.wav        |
| HK mic 4 array linear v3.2 | 2m         | Echo   |       2 |  4   | -25.89 | -26.26 |  -6.76 | 57.35 |      62.66 |            0 |                 1200 |     22.11 |      94.13 |       0.83 |        0 | michk_32/hk/2m/2_echo.wav        |
| HK mic 4 array linear v3.2 | 2m         | Echo   |       3 |  3   | -25.96 | -26.14 |  -5.96 | 26.21 |      85.62 |            0 |                   50 |      3.04 |      79.75 |       0.41 |        0 | michk_32/hk/2m/3_echo.wav        |
| HK mic 4 array linear v3.2 | 2m         | Echo   |       4 |  3   | -23.36 | -24.33 |  -4.91 | 29.85 |      82.94 |            0 |                   70 |      3.63 |      87.22 |       0.52 |        0 | michk_32/hk/2m/4_echo.wav        |
| HK mic 4 array linear v3.2 | 2m         | Echo   |       5 |  6   | -25.67 | -26.61 |  -3.58 | 39.85 |      69.28 |          250 |                  350 |     11.59 |      85.45 |       0.64 |        0 | michk_32/hk/2m/5_echo.wav        |
| CAE v1.2                   | 2m         | Echo   |       1 |  5   | -28.55 | -31.05 | -11.53 | 29.13 |      58.52 |            0 |                  130 |      7.58 |      86.47 |       0.82 |        0 | micanhthao/micmoi/2m/1_echo.wav  |
| CAE v1.2                   | 2m         | Echo   |       2 |  5   | -24.61 | -24.96 |  -4.76 | 31.74 |      78.36 |            0 |                  420 |      7.51 |      87.5  |       0.81 |        0 | micanhthao/micmoi/2m/2_echo.wav  |
| CAE v1.2                   | 2m         | Echo   |       3 |  4   | -22.52 | -24.11 |  -5.06 | 30.48 |      81.45 |            0 |                   80 |      3.07 |      87.52 |       0.81 |        0 | micanhthao/micmoi/2m/3_echo.wav  |
| CAE v1.2                   | 2m         | Echo   |       4 |  5   | -26.53 | -27.5  |  -7.59 | 27.79 |      80.96 |            0 |                   90 |      3.09 |      91.91 |       0.74 |        0 | micanhthao/micmoi/2m/4_echo.wav  |
| CAE v1.2                   | 2m         | Echo   |       5 |  4   | -22.4  | -23.94 |  -5.68 | 27.18 |      90.73 |            0 |                   40 |      0.88 |      95.8  |       0.67 |        0 | micanhthao/micmoi/2m/5_echo.wav  |
| HK mic 4 array linear v3.2 | 3m         | Echo   |       1 |  4   | -25.75 | -26    |  -3.87 | 36.97 |      74.69 |            0 |                  320 |      9.56 |      93.97 |       0.59 |        0 | michk_32/hk/3m/1_echo.wav        |
| HK mic 4 array linear v3.2 | 3m         | Echo   |       2 |  4   | -23.16 | -24.82 |  -4.08 | 36.26 |      71.43 |            0 |                  160 |      7.98 |      93.63 |       0.61 |        0 | michk_32/hk/3m/2_echo.wav        |
| HK mic 4 array linear v3.2 | 3m         | Echo   |       3 |  4   | -25.62 | -28.15 |  -5.35 | 53.78 |      62.91 |            0 |                  960 |     20    |      90.92 |       0.86 |        0 | michk_32/hk/3m/3_echo.wav        |
| HK mic 4 array linear v3.2 | 3m         | Echo   |       4 |  5   | -25.26 | -26.91 |  -4.16 | 57.57 |      58.92 |         1090 |                 1090 |     24.15 |      91.08 |       0.85 |        0 | michk_32/hk/3m/4_echo.wav        |
| HK mic 4 array linear v3.2 | 3m         | Echo   |       5 |  5   | -24.69 | -26.7  |  -8.09 | 51.94 |      63.53 |           50 |                  350 |     14.16 |      86.27 |       0.7  |        0 | michk_32/hk/3m/5_echo.wav        |
| CAE v1.2                   | 3m         | Echo   |       1 |  6   | -25.71 | -28.15 |  -6.22 | 31.37 |      60.43 |          190 |                  250 |      8.06 |      95.81 |       0.75 |        0 | micanhthao/micmoi/3m/1_echo.wav  |
| CAE v1.2                   | 3m         | Echo   |       2 |  4   | -21.84 | -23.72 |  -4.81 | 33.93 |      73.93 |          120 |                  290 |      7.97 |      96.39 |       0.8  |        0 | micanhthao/micmoi/3m/2_echo.wav  |
| CAE v1.2                   | 3m         | Echo   |       3 |  3   | -23.22 | -26.19 |  -7    | 33.99 |      58.86 |            0 |                  290 |     10.65 |      97.6  |       0.82 |        0 | micanhthao/micmoi/3m/3_echo.wav  |
| CAE v1.2                   | 3m         | Echo   |       4 |  2   | -25.9  | -28.71 | -11.46 | 31.67 |      62.81 |           10 |                  260 |     12.41 |      98.07 |       0.72 |        0 | micanhthao/micmoi/3m/4_echo.wav  |
| CAE v1.2                   | 3m         | Echo   |       5 |  4   | -22.51 | -26.3  |  -6.35 | 32.35 |      70.68 |            0 |                  330 |      8.16 |      97.87 |       0.77 |        0 | micanhthao/micmoi/3m/5_echo.wav  |

## 15. Limitations

This report measures the available recordings only. Because there is no separated far-end / speaker reference signal, true ERLE is not computed; echo and dropout metrics are heuristics for relative comparison under this test set.
