We collect images and annotate them with medical imaging specialists. In particular, COVID-19 include 1,517 NORMAL cases, 1,467 PNEUMONIA cases, 439 cases. In addition, images are gathered from sources: COVID-19 Radiography Database, Covid-19 Image Dataset, COVID-19 PatientsLungs X Ray Images 10000, COVID-19 High quality images.
UIT-DODV is the first Vietnamese document image dataset, including 2,394 images with four classes: Table, Figure, Caption, Formula. UIT-DODV converted 1,696 images from PDF with size 1,654 x 2,338, 247 images scanned from the physical scanner and expanded with 451 images scanned from the smartphone.
UIT-DODV has the following highlights:
Vietnam’s ethnic minority costumes gradually receive more attention through images which performed by KOLs or in music videos. The need of searching national costumes is increasing day by day. However, it is quite difficult to collect data on images of ethnic minority costumes on the internet. Therefore, we built a dataset about Vietnamese Ethnic Minority Costumes Classification (UIT-VEMC21), consisting of these 10 most largest ethnic groups: Tay, Thai, Muong, H’Mong, Dao, Ede, Ba Na, Cham, San Diu, Ra Glai.
Our dataset consists of 10 categories: Tay, Thai, Muong, H'Mong, Dao, Ede, Ba Na, Cham, San Diu, Ra Glai, which are carefully checked and labeled by certain sources. Samples are collected from 10 volunteers who wearing and changing 2-3 outfits. These photos were taken in clear and cloudy weather during the day.
UIT-VinaFruit20 contains 63,541 images are corresponding to 20 Vietnam popular fruits, with the average images per class are about 3,700 images. UIT-VinaFruit20 is the first Vietnamese fruit dataset with four main features promising more challenges.
UIT-VinaFruit20 has the following highlights:
With the current situation of traffic in Vietnam, we are facing many outstanding problems, especially traffic congestion since the supply of infrastructures has often not been able to keep up with mobility growth. A large number of CCTV, radar sensors are installed to monitor vehicles and collect traffic information, which helps agencies to keep track of traffic flow, vehicle density and parking status. However, these methods do not provide a sufficient overview to develop and solve the current situations.
Recently, images taken from UAVs have been easy to collect and extremely useful due to their ability of covering large areas in a single image and high resolution in a small number of locations. Thanks to this high resolution, vehicles can be detected even small objects such as cars and motors. But, to be able to solve the problem of vehicle detection well, we have to have a good classification model which can deal with the current situation of traffic in Vietnam. So that, our team built a dataset named UIT-CVID21 (Classifying Vehicle In Image From Drone) which can reflect the reality of Vietnam traffic to create premises for later studies and address problems such as traffic density management, traffic separation and traffic congestion.
UIT-CVID21 has 10,000 images which include four classes: bus, car, truck and van. This is one of the first dataset in this present time that has captured the most diverse angles and clearly show the characteristics of Vietnamese road thanks to the flexibility of unmanned aerial equipment (Drone).
Thanks to the advantage of high mobility, Unmanned Aerial Vehicles (UAVs) are used to provide many essential tasks in computer vision, bringing more efficiency and convenience than fixed surveillance cameras or ground moving sensors with limited angle and visibility. UAVs have many practical applications. However, drone datasets are still limited, and they focus only on a few specific tasks, such as visual tracking or object detection in certain situations.
In this project, we built a novel dataset – UIT-Drone21, to advance drone-based image analysis tasks with complex scenarios that promise new challenges. Our dataset was chosen from 23 short videos of approximately 13,066 fully labeled frames with bounding boxes for many tasks such as object detection, sing-object tracking, and multiple object tracking. We added approximately 2,304 frames (about 15,370 frames total) for object detection tasks.
UIT-Flower dataset includes 81,909 images for 21 flowers, which promises many challenges in building classification models in 4 perspectives:
Vietnamese cuisine encompasses diverse dishes from the mainstream culinary traditions in all three regions of Vietnam to the street food with original and creative recipes. We conduct a small survey on the Internet to choose the most favorite street food and dishes in typical traditional southern Vietnamese meals.
VinaFood21 contains 13,950 images are corresponding to 21 dishes. This is a more comprehensive food dataset that surpasses existing Vietnamese Food datasets from the following three aspects.
The UIT-Anomaly dataset includes a total of 224 muted videos captured at a frame rate of 30 fps with various resolutions. It has 104 normal and 120 anomalous videos. The total duration is more than 200 minutes, corresponding to 392,188 frames. We divide these videos into two subsets: the training set included 90 abnormal and 90 normal videos, while the test set consisted of the remaining 30 abnormal and the remaining 14 normal videos. Both training and test sets contain six classes of anomalies.
UIT-Anomaly has the following highlights:
The VNAnomaly dataset consists of 89 training videos and 96 evaluating videos that include real-world anomalies in Vietnamese street. The anomaly types contain 4 common human-related anomalies in the street of Vietnam including fighting, assault, vandalism, and robbery. Because the VNAnomaly is an unsupervised dataset, we will not explicitly define these anomaly types. The reason we choose the above anomaly types is the popularity of these types compared to other ones. In addition, these anomaly types are also relevant to the safety of public lives and assets in Vietnam’s urban environments. However, there are some unusual events that are not mentioned such as traffic accidents. Therefore, we will continue to provide more anomaly types soon. Our dataset surpasses existing unsupervised datasets from the following three aspects.
VNAnomaly has the following highlights:
We present a dataset with 18 dishes typically found during the Vietnamese traditional Tet holiday. There are a total of 77,000 images divided into two definite sets with a ratio of 8:2. The training set consists of 63,840 images, whereas the test set has 14,041 images. The ratio of each food’s images in two subsets is randomly divided. Real-world data goes hand in hand with challenges; this means that the accuracy and performance of a deep learning model or machine learning model will be heavily influenced by the obstacle that the training data presents. By raising these concerns, we hope to lay the groundwork for optimal solutions proposed in the future:
UIT-VinaDeveS22 has the following highlights:
UIT-DODV-Ext is the largest dataset of Vietnamese document images with 5,000 images for three objects on the page: Table, Figure, Caption. Document images of the UIT-DODV-Ext dataset are Vietnamese publications such as scientific papers and textbooks in PDF, which were scanned and converted to be stored as images. UIT-DODV-Ext promises to pose many challenges from four main factors: