Rice is not Ice Cream: the Philippines

Image recognition (IR) systems often perform poorly once in the real world. In this post, I test four of the most popular IR systems on original real world images of food from around the world, this time from the Philippines.

Key takeaway

Overall, the systems performed very poorly. Not a single item was correctly detected and severe cultural misrepresentations were made. Vision was able to pick up on the visual characteristics of the BBQ, but unfortunately ascribed it to many different cultures and countries except for the Philippines.

Correctly predicted images 0/3
Correctly detected items 0/9
Correct labels 4/104
Potentially harmful detections/labels
35
The above table includes only detections and labels of 80%+ confidence level, for lower confidence levels see the tables further below.

Insights

Object Detection

The object detection features failed to identify a single item across all three images. Descriptions that were given, were very general (e.g. “Food”). Also, Rekognition identified rice as “Ice Cream” across all three images, while Vision also identified the rice as “Ice Cream” and “Dessert”. Many of the items remained undetected.

Labeling

The BBQ is perhaps one of the most visually distinctive item in the three images. Vision clearly recognized this distinctiveness as it consistently labeled skewered BBQ like dishes such as Sate Kambing (Indonesian mutton satay), Shashlik (Caucasus/Central Asian skewered meat), Yakatori (Japanese skewered pork or chicken), and many more.

Unfortunately, none of these labels referred to the Philippines, which makes us question if any cultural misrepresentation is at play here (BBQ is a very common dish in the Philippines). It also makes us question as to in how far it is possible to distinguish visually very similar dishes between cultures based on an image without context.

While in previous analyses rice seemed somewhat easy to detect, Watson and Rekognition mistook rice as “Ice Cream” across multiple images. Vision also made this error (once as “Ice Cream” and another time as “Dessert”), though it did also label it as “Rice”. Here, again, it makes us wonder if any cultural misrepresentation is at play. Though one can imagine the visual similarities between the round shaped presentation of the rice (which is common in many Asian countries) in the images and ice cream, one can not help but feel ice cream was simply more represented in the training data.

As always, many labels were also too general or irrelevant.

My recommendation

  • Provide more specific and relevant labels for “BBQ” and “[Cup of] BBQ Sauce”;
  • Fix (cultural) misrepresentations (i.e. rice is not ice cream);
  • Understand the limits of how well a IR system could distinguish between similar dishes of different countries without further context or input, and what consequences this limit could have.

Results

Three images of one meal from the Philippines were available:

  • Meal 1: White rice and BBQ (dinner)

Object detection results.

GROUND TRUTH MICROSOFT AZURE GOOGLE VISION AMAZON REKOG. IBM WATSON
White Rice Undetected Dessert (0.57) Ice Cream (0.96) /
BBQ Undetected Tableware (0.51) Undetected /
Cup of BBQ Sauce Bowl (0.61) Tableware (0.88) Undetected /

*Green = the right prediction; Yellow= the right prediction, but too general; Red = potentially harmful prediction; White = largely not relevant

Labeling Results:

MICROSOFT AZURE GOOGLE VISION AMAZON REKOG. IBM WATSON
food (0.98) Food (0.98) Ice Cream (0.96)
Ice Cream or Frozen Yoghurt (0.9)
person (0.94) Tableware (0.96) Dessert (0.96) dessert (0.9)
cuisine (0.93) Ingredient (0.91) Cream (0.96) nutrition (0.9)
snack (0.91) Suya (0.9) Creme (0.96) food (0.9)
fast food (0.9) Dishware (0.89) Food (0.96) Ice Cream Parlor (0.8)
dish (0.89) Plate (0.88) Meal (0.92) shop (0.8)
dairy (0.85) Shashlik (0.88) Person (0.88) retail store (0.8)
indoor (0.82) Recipe (0.87) Human (0.88) building (0.8)
table (0.78) Anticuchos (0.86) Bowl (0.85) chocolate color (0.61)
  Cuisine (0.85) Dish (0.8) dark red color (0.57)
  Dish (0.84) Dish (0.8)  
  Brochette (0.84)    
  Sate kambing (0.83)    
  Fried food (0.75)    
  Beef (0.72)    
  Meat (0.72)    
  Cooking (0.72)    
  Produce (0.72)    
  Churrasco food (0.71)    
  Bowl (0.7)    
  Platter (0.69)    
  Fork (0.69)    
  Buffalo wing (0.67)    
  Finger food (0.66)    
  Comfort food (0.65)    

Object detection results.

GROUND TRUTH MICROSOFT AZURE GOOGLE VISION AMAZON REKOG. IBM WATSON
Rice Undetected Ice Cream (0.69) Ice Cream (0.94) /
BBQ Undetected Food (0.71) Undetected /
Cup of BBQ Sauce Bowl (0.53) Tableware (0.84) Undetected /

*Green = the right prediction; Yellow= the right prediction, but too general; Red = potentially harmful prediction; White = largely not relevant

Labeling Results:

MICROSOFT AZURE GOOGLE VISION AMAZON REKOG. IBM WATSON
food (0.99) Food (0.98) Ice Cream (0.94)
Ice Cream or Frozen Yogurt (0.89)
cuisine (0.92) Tableware (0.95) Dessert (0.94) dessert (0.89)
dairy (0.91) White rice (0.93) Cream (0.94) nutrition (0.89)
ice cream (0.89) Ingredient (0.9) Creme (0.94) food (0.89)
person (0.62) Recipe (0.88) Food (0.94)
alizarine red color (0.84)
  Plate (0.87) Meal (0.89)
Ice Cream Parlor (0.62)
  Jasmine rice (0.87) Plant (0.77) shop (0.62)
  Staple food (0.85) Dish (0.73) retail store (0.62)
  Sate kambing (0.84) Outdoors (0.59) building (0.62)
  Shashlik (0.84)    
  Rice (0.84)    
  Brochette (0.83)    
  Glutinous rice (0.82)    
  Anticuchos (0.81)    
  Fork (0.81)    
  Suya (0.8)    
  Dish (0.78)    
  Nasi lemak (0.77)    
  Steamed rice (0.76)    
  Produce (0.76)    
  Basmati (0.75)    
  Cuisine (0.73)    
  Meat (0.72)    
  Fried food (0.71)    
  Comfort food (0.7)    

Object Detection Results:

GROUND TRUTH MICROSOFT AZURE GOOGLE VISION AMAZON REKOG. IBM WATSON
Rice Undetected Food (0.6) Ice Cream (0.81) /
BBQ Undetected Undetected Undetected /
Cup of BBQ Sauce Bowl (0.66) Bowl (0.88) Undetected /

Labeling Results:

MICROSOFT AZURE GOOGLE VISION AMAZON REKOG. IBM WATSON
snack (0.9) Food (0.99) Person (0.98) building (0.82)
dairy (0.89) Brochette (0.95) Human (0.98) food (0.8)
fast food (0.88) Suya (0.94) Ice Cream (0.81) shop (0.73)
person (0.84) Tableware (0.94) Dessert (0.81) retail store (0.73)
food (0.82) Ingredient (0.91) Cream (0.81)
chestnut color (0.73)
indoor (0.61) Anticuchos (0.91) Creme (0.81) deli (0.64)
  Recipe (0.88) Food (0.81) restaurant (0.55)
  Shish taouk (0.88) Meal (0.65) bakery (0.5)
  White rice (0.87)    
  Shashlik (0.87)    
  Sate kambing (0.85)    
  Pincho (0.84)    
  Satay (0.84)    
  Dish (0.83)    
  Arrosticini (0.83)    
  Rice (0.82)    
  Cuisine (0.82)    
  Souvlaki (0.81)    
  Plate (0.81)    
  Churrasco food (0.8)    
  Cooking (0.79)    
  Jasmine rice (0.79)    
  Yakitori (0.77)    
  Kebab (0.77)    
  Produce (0.75)