Eval JavaScript - Search News

17h

Malware scam: Job offers trick developers with malicious repositories

Developers now need to be careful with job offers. Criminals are trying to distribute infostealers through them.

SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words

SD-Eval is a benchmark dataset aimed at multidimensional evaluation of spoken dialogue understanding and generation. SD-Eval focuses on paralinguistic and environmental information and includes 7,303 ...

GitHub

Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making

We aim to evaluate Large Language Models (LLMs) for embodied decision-making. While many works leverage LLMs for decision-making in embodied environments, a systematic understanding of their ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Malware scam: Job offers trick developers with malicious repositories

SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words

Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making

Trending now