A new collection of 124 million unique passwords from hundreds of millions of malware stealer log records has been confirmed ...
A collection of 114,000 music tracks ripped from Spotify. The data set was assembled by an unknown AI developer on Hugging ...
The dataset, which the researchers have made available on the Open Reaction Database, is nearly five times as large as the ...
We developed a DICOM dataset that can be used to evaluate the performance of de-identification algorithms. DICOM objects (a total of 1,693 CT, MRI, PET, and digital X-ray images) were selected from ...
Speech AI datasets look interchangeable until production exposes gaps in transcripts, speakers, audio conditions, licenses, ...
What dataset features affect machine learning (ML) performance has primarily been unknown in the current literature. This study examines the impact of tabular datasets' different meta-level and ...
While many global road maps exist, few include detailed surface information or keep pace with rapid infrastructure change. The new HeiGIT dataset closes this gap by combining 3–4 meter resolution ...
MISMO updated its PaVS procurement dataset to standardize valuation orders, support UAD 3.6, and cut proprietary integrations ...