Developers now need to be careful with job offers. Criminals are trying to distribute infostealers through them.
SD-Eval is a benchmark dataset aimed at multidimensional evaluation of spoken dialogue understanding and generation. SD-Eval focuses on paralinguistic and environmental information and includes 7,303 ...
We aim to evaluate Large Language Models (LLMs) for embodied decision-making. While many works leverage LLMs for decision-making in embodied environments, a systematic understanding of their ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results