Skip to main content

Curated list of resources for creating original datasets for original Data Science, Machine Learning and AI research and projects

0
GitHub Stars
50
Curated Resources
5
Categories
23 hours ago
Last Refreshed
TutorialsLibrariesAcademic PapersServicesDatasets

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me simulation resources from awesome-dataset-creation"

Installation instructions →

What's inside

Libraries

  • AirSimSimulation

    AirSim is a simulator for drones, cars and more, built on Unreal and Unity engines.

  • Contrastive Unpaired TranslationImage

    Contrastive unpaired image-to-image translation, faster and lighter training than cyclegan.

  • Denoising Diffusion PytorchImage

    Implementation of DDPM

  • gretel-syntheticsText, Tabular and Time-Series

    Generative models for structured and unstructured text, tabular, and multi-variate time-series data featuring differentially private learning.

  • JukeboxAudio

    OpenAI's Jukebox- A Generative Model for Music.

  • Nvidia Dataset SynthesizerSimulation

    NDDS is a UE4 plugin from NVIDIA to empower computer vision researchers to export high-quality synthetic images with metadata.

Tutorials

Datasets

Academic Papers

Services

Showing a sample of 50 resources. View the full list on GitHub →