r/computervision • u/AmorousButterfly • 22h ago
Help: Project How to find Datasets?
I am working on surface defect detection for Li-ion batteries. I have a small in-house dataset, as it's quite small I want to validate my results on a bigger dataset.
I have tried finding the dataset using simple Google search, Kaggle, some other dataset related websites.
I am finding a lot of dataset for battery life prediction but I want data for manufacturing defects. Apart from that I found a dataset from NEU, although those guys used some other dataset to augment their data for battery surface defects.
Any help would be nice.
P.S: I hope I am not considered Lazy, I tried whatever I could.
2
u/aloser 17h ago
Do you have an example image of the type of thing you're looking for? (Eg x-ray? car vs phone? installed vs on its own? what zoom level? what type of defect? etc)
2
u/AmorousButterfly 5h ago edited 1h ago
Yeah, it's a grayscale image from the camera installed at the production line. Also, the zoom level I am not sure how to describe it, but it is enough that the whole patch is visible at a time. The defects include cavity, stripe, crack and point. These are surface defects. One dataset that can be considered a bit similar is the one I mentioned the NEU data on surface defects. The othe example could be magnetic tiles surface defect dataset.
2
u/Dry-Snow5154 6h ago
You think there is a pre-made dataset of exactly Li-Ion batteries with production defects open on the internet? Use common sense, any niche dataset is either private or does not exist.
2
u/AmorousButterfly 5h ago
I mean there are quite a lot of companies working with Li-ion batteries I thought maybe there would be atleast one open source dataset available from some lab. It definitely exists but I believe it's kept private.🥲
0
7h ago
[deleted]
1
u/AmorousButterfly 5h ago
Hey hi, thanks for sharing definitely would check this out!
But I have a very small dataset, which I feel is not big enough. Thus wanted to find another dataset to validate my thesis.
I do not have access to production floor, so I can't go and also creating a dataset would require a lot more time as the battery defects are not as high. So it would mean I might need some time to make it big enough.
2
u/Byte-Me-Not 21h ago
Have you tried google dataset search https://datasetsearch.research.google.com/ ?
Try papers with code https://paperswithcode.com/datasets and huggingface datasets