当前位置:网站首页>Notes on Flickr's dataset
Notes on Flickr's dataset
2022-07-25 17:34:00 【Wsyoneself】
- flickr8k Image annotation dataset :
- Data set containing 8,000 Zhang image , Each image is paired with five different titles , These titles provide content descriptions of objects and events in the pictures
- This data set seems to be related to the image description task ( Generate a text description for the image ) of .
- Image subtitles generate excellent data sets that can be used :flickr8k Data sets , Realistic and relatively small .
- Flickr30K It's from Flickr The sorted out content downloaded from 30k Pictures and data sets corresponding to description sentences
- IGEODATA Data sets :
- The data set consists of ten bzip2 Compressed files (yfcc100m_dataset-0.bz2 To yfcc100m_dataset-9.bz2) form , Each file contains 10M That's ok , Each row contains the following tab delimited fields :* Photo / Video identifier 、* user NSID、* The user nickname 、* Date of shooting 、* Upload date 、* Capture devices 、* title 、* describe 、* user tags( Comma separated )、* machine tags( Comma separated )、* longitude 、* latitude 、* accuracy 、* Photo / Video page URL、* Photo / Video downloading URL、* License name 、* license URL、* Photo / Video server identifier 、* Photo / Video field identifier 、* Photo / Video confidentiality 、* Photo / Original confidential video 、* Expansion of the original photo 、* Photo / Video Tags (0= Photo ,1= video )
- The field containing free-form text has been URL code . Not all fields have values , Especially the camera 、 title 、 describe 、 Mark 、EXIF、 longitude 、 The latitude and precision fields may be empty . Please note that , The original extension is only meaningful for photos , It doesn't make sense for video ( Please check the first few bytes of the video to determine its file format ).
- In addition to dataset files , Also provided is a photo containing / Video identifier and its corresponding MD5 Hash (yfcc100m_hash.bz2) The file of . These hashes will be used for externally hosted expansion packs ( For example, function 、 notes ), As an indirect layer , To Hide Photos / Direct access to video information .
边栏推荐
猜你喜欢
![[knowledge atlas] practice -- Practice of question answering system based on medical knowledge atlas (Part3): rule-based problem classification](/img/4c/aeebbc9698f8d5c23ed6473c9aca34.png)
[knowledge atlas] practice -- Practice of question answering system based on medical knowledge atlas (Part3): rule-based problem classification

Idea 必备插件

The gas is exhausted! After 23 years of operation, the former "largest e-commerce website in China" has become yellow...

论文阅读_多任务学习_MMoE

STM32 PAJ7620U2手势识别模块(IIC通信)程序源码详解

Bo Yun container cloud and Devops platform won the trusted cloud "technology best practice Award"

I2C通信——时序图

对灰度图像的三维函数显示

「数字安全」警惕 NFT的七大骗局

带你初步了解多方安全计算(MPC)
随机推荐
Technical difficulties and applications of large humanoid robots
对灰度图像的三维函数显示
Multi tenant software development architecture
「数字安全」警惕 NFT的七大骗局
四六级
Redis源码与设计剖析 -- 17.Redis事件处理
【Cadence Allegro PCB设计】永久修改快捷键(自定义)~亲测有效~
【硬件工程师】DC-DC隔离式开关电源模块为什么会用到变压器?
ROS learning notes (IV) ROS cannot solve rosdep init or update
stm32F407------SPI
基于SqlSugar的开发框架循序渐进介绍(13)-- 基于ElementPlus的上传组件进行封装,便于项目使用
[Hardware Engineer] about signal level driving capability
Excel表格 / WPS表格中怎么在下拉滚动时让第一行标题固定住?
Is there a method in PostgreSQL that only compiles statements but does not execute them?
【硬件工程师】元器件选型都不会?
【Cadence Allegro PCB设计】error: Possible pin type conflict GND/VCC Power Connected to Output
The gas is exhausted! After 23 years of operation, the former "largest e-commerce website in China" has become yellow...
ACL 2022 | comparative learning based on optimal transmission to achieve interpretable semantic text similarity
【硬件工程师】关于信号电平驱动能力
第三章、数据类型和变量