DefinedCrowd Works to Address AI Bias
DefinedCrowd is expanding its online data marketplace, DefinedData, to third-party suppliers to sell or share AI datasets and working with NVIDIA to provide dataset samples through the NVIDIA NGC catalog.
To address biases in artificial intelligence, DefinedData's catalog now offers detailed information on the gender, age, accent, and phonetic distribution of datasets as well as meta-data on the recordings, and audio samples.
DefinedCrowd will provide dataset samples through the NVIDIA NGC catalog, a hub for AI and HPC containers, pre-trained models and SDKs that simplifies and accelerates end-to-end workflows. Datasets can be used to train models using libraries within the NVIDIA Jarvis application framework; NVIDIA Transfer Learning Toolkit, which enables developers to build production-quality models faster with no coding required; as well as the NVIDIA NeMo platform, a Python toolkit for building, training, and fine-tuning unmatched GPU-accelerated conversational AI models.
"By working with DefinedCrowd, we're providing NVIDIA Jarvis and NeMo users with sample datasets to build and accelerate their models, all within the NGC environment," said Richard Kerris, head of developer relations at NVIDIA, in a statement.
"Companies constantly need to engage a long tail of data in order to grow in new sectors, and data scientists need the raw material in order to address these issues as data science becomes more democratic each day," said DefinedCrowd's director of machine learning, Christopher Shulby, in a statement. "This offering will allow data scientists to keep their models relevant in a continually evolving world."