
            <meta charset='UTF-8'>
            <style type='text/css'>
                table {border:1px solid #555;}
                td {border:1px solid #999;}
            </style> <strong>Table 2: Comparison with audio-driven facial animation methods. The best LVE value is marked in bold. “<inline-formula><mml:math id="m18"> <mml:mrow> <mml:msubsup> <mml:mi>z</mml:mi> <mml:mrow> <mml:mtext>FACS</mml:mtext> </mml:mrow> <mml:mrow> <mml:mtext>GT</mml:mtext> </mml:mrow> </mml:msubsup> </mml:mrow> </mml:math></inline-formula> +NFR” indicates the upper bound of the performance of our method.</strong><br><br><table><thead> <tr> <th valign="top" align="left">Method</th> <th valign="top" align="center">Mesh-agnostic</th> <th valign="top" align="center">LVE ↓ (×10<sup>−3</sup>)</th> </tr> </thead> <tbody> <tr> <td valign="top" align="left">CodeTalker</td> <td valign="top" align="center">✗</td> <td valign="top" align="center">1.5927 ± 0.8608</td> </tr> <tr> <td valign="top" align="left">Faceformer</td> <td valign="top" align="center">✗</td> <td valign="top" align="center">1.4854 ± 0.9858</td> </tr> <tr> <td valign="top" align="left">Ours</td> <td valign="top" align="center">✓</td> <td valign="top" align="center"><bold>1.1776</bold> ± 0.7796</td> </tr> <tr> <td valign="top" align="left"><inline-formula><mml:math id="m19"> <mml:mrow> <mml:msubsup> <mml:mi>z</mml:mi> <mml:mrow> <mml:mtext>FACS</mml:mtext> </mml:mrow> <mml:mrow> <mml:mtext>GT</mml:mtext> </mml:mrow> </mml:msubsup> </mml:mrow> </mml:math></inline-formula> + NFR</td> <td valign="top" align="center">-</td> <td valign="top" align="center">1.0642 ± 0.7425</td> </tr> </tbody></table>