five

Free Sample Dataset - 1000 High Resolution Images & Metadata|图像数据数据集|机器学习数据集

收藏
Databricks2024-05-09 收录
图像数据
机器学习
下载链接:
https://marketplace.databricks.com/details/3da47c97-478c-4935-a0e7-c61502c7c7b7/Shutterstock_Free-Sample-Dataset---1000-High-Resolution-Images-&-Metadata
下载链接
链接失效反馈
资源简介:
**Overview** This is a free sample dataset consisting of 1000 images and accompanying metadata sourced from our +550 million image library. Image types for this sample include photos, vectors, and illustrations across a vast range of content categories and settings. This sample includes a wide range of metadata fields including content descriptions and keywords that make it ideal for powering a wide variety of machine learning use cases. If you’d like to start licensing data from the full range of imagery and metadata available at Shutterstock please reach out directly to our team at sales.databricks@shutterstock.com to start using our tailored services to help ideate, curate and customize datasets for your unique business needs. **Use cases** This type of data can be licensed from Shutterstock for a wide variety of use cases including powering machine learning models that have generative capabilities. **Metadata** Sample metadata fields included in this dataset are listed below, for a full list of all metadata available from Shutterstock please contact our team at sales.databricks@shutterstock.com. **asset metadata:** id keywords image_type is_creative mature_flag date_submitted date_captured asset_location popularity_score german_description spanish_description french_description korean_description japanese_description labels moderation_labels has_model_release has_people primary_category
 **file metadata:** asset_id asset_file_size asset_file_extension asset_file_size_in_bytes width height orientation
 **model metadata:** asset_id model_release_id age_range age_in_years gender ethnicity **Our data** Shutterstock offers the largest, highest quality and most diverse collection of creative content with best-in-class metadata, giving technology businesses the scale and accuracy they need to build and sustain a wide variety of machine learning models. Our growing library of +550M images, +40M videos, +4M music and audio tracks, and +1.2M 3D models and data is human-reviewed for accuracy and IP infringement, allowing you to use our data worry-free and avoid unwanted or unlawful content. We ethically source all content from over 2 million creators in +150 countries and with over 60 million new assets added annually, our ever-growing library gives you access to fresh and diverse datasets that can be refreshed regularly to meet all your data needs.
提供机构:
Shutterstock
用户留言
有没有相关的论文或文献参考?
这个数据集是基于什么背景创建的?
数据集的作者是谁?
能帮我联系到这个数据集的作者吗?
这个数据集如何下载?
点击留言
数据主题
具身智能
数据集  4098个
机构  8个
大模型
数据集  439个
机构  10个
无人机
数据集  37个
机构  6个
指令微调
数据集  36个
机构  6个
蛋白质结构
数据集  50个
机构  8个
空间智能
数据集  21个
机构  5个
5,000+
优质数据集
54 个
任务类型
进入经典数据集
热门数据集

Google Scholar

Google Scholar是一个学术搜索引擎,旨在检索学术文献、论文、书籍、摘要和文章等。它涵盖了广泛的学科领域,包括自然科学、社会科学、艺术和人文学科。用户可以通过关键词搜索、作者姓名、出版物名称等方式查找相关学术资源。

scholar.google.com 收录

Asteroids by the Minor Planet Center

包含所有已知小行星的轨道数据和观测数据。数据来源于Minor Planet Center,格式包括Fortran (.DAT)和JSON,数据集大小为81MB(压缩)和450MB(未压缩),记录数约750,000条,每日更新。

github 收录

UniProt

UniProt(Universal Protein Resource)是全球公认的蛋白质序列与功能信息权威数据库,由欧洲生物信息学研究所(EBI)、瑞士生物信息学研究所(SIB)和美国蛋白质信息资源中心(PIR)联合运营。该数据库以其广度和深度兼备的蛋白质信息资源闻名,整合了实验验证的高质量数据与大规模预测的自动注释内容,涵盖从分子序列、结构到功能的全面信息。UniProt核心包括注释详尽的UniProtKB知识库(分为人工校验的Swiss-Prot和自动生成的TrEMBL),以及支持高效序列聚类分析的UniRef和全局蛋白质序列归档的UniParc。其卓越的数据质量和多样化的检索工具,为基础研究和药物研发提供了无可替代的支持,成为生物学研究中不可或缺的资源。

www.uniprot.org 收录

中国裁判文书网

中国裁判文书网是中国最高人民法院设立的官方网站,旨在公开各级法院的裁判文书。该数据集包含了大量的法律文书,如判决书、裁定书、调解书等,涵盖了民事、刑事、行政、知识产权等多个法律领域。

wenshu.court.gov.cn 收录

VoxBox

VoxBox是一个大规模语音语料库,由多样化的开源数据集构建而成,用于训练文本到语音(TTS)系统。

github 收录