This Grab Tech’s blog post reads very promising. Actually, this is something that many people have been waiting for from Mapillary. Apparently, it takes people who are closer to that manufacturing and market closer. Unfortunately, the blog post does not say anything about availability or price of the camera. Nevertheless, things look impressive enough and read very promising so far. The camera should be even capable of some AI on the edge. However, I do not agree with every assumption and capability made by Grab Tech’s engineers:
-
Privacy information detection
Face and license plate blurring on the edge (while be it with AI) is a great feature not only for mapping but also for publishing imagery in general in the digital age. However, every AI feature must have an override switch! -
Scene recognition model
-
Image quality (IQ) checking AI model
-
Object detection AI model
If anything, these things should happen on the backend. A camera should not rate or qualify/disqualify the images you capture. Just think about all the false positives. Furthermore, there is a risk of creating a single purpose camera. Devices in general, and digital devices especially should not patronize users. They can and sometimes should warn users about potential dangers or potentially illegal actions but users (people) should decide what to do and not to do, not machines.Cameras should not process or filter images too much either, be it with AI or static filters. When capturing a photo we should be interested in the physical truth, not some digital glitter processed output stream. We should want to record — as the name in photography says — photons, not produce some prettified pixels.
Their “quadcam” setup for panoramic images also looks interesting. Full spherical imagery is not always really necessary for many purposes, including mapping. What is often more important then full 360° is ease of use, capture, and stitching.
I am also surprised that they deem a 12 MP sensor as sufficient for street-view mapping, especially from vehicles. My computations have lead me to believe that a 4896×3264, ~16 MP, 3÷2, ~2.45µ per pixel sensor with a linear lens of ~13 mm focal length is probably the ideal candidate combo for this purpose. But hey, this is just my opinion based on my assumptions.
Anyhow, I would be happy if Mapillary would get in touch with Grab Tech and perhaps opt for some collaboration in this space.