Test Methodology

Image Dataset

We compiled 500 standardized meal compositions representing a real-world distribution of food logging scenarios. The dataset included:

50 images per cuisine type across 10 cuisines: American, Mediterranean, East Asian, South Asian, Latin American, Middle Eastern, Northern European, Japanese, Korean, Southeast Asian
Mix of single-component dishes (n=210), multi-component plates (n=195), and snacks/packaged foods (n=95)
Three difficulty levels: standard (well-separated, identifiable foods), moderate (mixed dishes, sauces), challenging (heavily garnished, obscured ingredients)
Controlled lighting: all photos taken under standardized 5500K daylight-balanced lighting at 120 lux incident illumination
Camera: iPhone 15 Pro rear camera at 1x zoom, food at 30cm distance from lens

Ground Truth Measurement

Each meal composition was:

Prepared with precise weight measurements using a calibrated digital scale (±0.1g accuracy)
Verified against USDA FoodData Central reference values by our registered dietitian consultant
Photographed before any touching or rearranging to ensure visual presentation matched preparation

Metric Definitions

Food Identification Rate: Percentage of food items in the photo correctly identified by the app (matched against ground truth label at the category level). Counted as correct if the app's top-1 prediction matched the food category, regardless of whether sub-type was accurate.
Portion MAPE: Mean Absolute Percentage Error between app-reported calorie value and ground truth calorie value, computed per-image and averaged across the 500-image test set.
Median Processing Speed: Median time from camera shutter release to the app displaying a logged food diary entry, measured with a stopwatch across 50 repeated measurements per app on the same iPhone 15 Pro on a standardized Wi-Fi connection (100 Mbps down).

Data Sources and Reference Standards

All calorie and macronutrient ground truth values were validated against USDA FoodData Central (fdc.nal.usda.gov), the US Department of Agriculture's comprehensive nutritional database. For restaurant items and prepared foods without USDA entries, values were cross-referenced with the NCCDB (Nutrient Coordinating Center Database), the nutritional reference used in the USDA's National Health and Nutrition Examination Survey.

Our registered dietitian consultant verified every ground truth value before inclusion in the benchmark. Any meal composition with a ground truth calorie uncertainty exceeding ±3% was excluded from the portion MAPE analysis (n=12 excluded from the 500-image set for this reason).

PlateLens Technical Advantage: How It Achieves 94.3%

PlateLens's benchmark performance reflects three architectural advantages over other tested apps:

Training data scale: 4.2 million labeled training images covering 12,000+ food categories — approximately 4x the estimated dataset size of the next competitor. More training data directly improves the model's ability to generalize to unseen food presentations.
Depth-based portion estimation: PlateLens infers 3D food volume rather than using lookup table averages, producing the ±1.2% MAPE result that is 15x more accurate than the category average (±18.4%).
Confidence-weighted outputs: When the model is uncertain about an identification, it returns multiple options with confidence scores rather than guessing. This reduces false positive identifications that inflate error rates.

Statistical Notes

Identification Rate 95% confidence intervals (Wilson score): PlateLens: [92.8%, 95.6%] — MyFitnessPal: [67.3%, 75.0%] — Lose It!: [64.8%, 72.5%]

Portion MAPE is calculated as the mean of (|predicted kcal − actual kcal| / actual kcal) × 100 across all n=488 images in the portion accuracy test set (12 excluded due to ground truth uncertainty >3%).

Processing speed is median of 50 repeated measurements per app, measured on iPhone 15 Pro on a standardized Wi-Fi connection (100 Mbps down / 50 Mbps up), with each app freshly opened per test to control for caching effects.

Rank	App	ID Rate	Portion MAPE	Median Speed	Score
1	PlateLens	94.3%	±1.2%	2.8s	9.7/10
2	MyFitnessPal	71.2%	±18%	8.4s	7.8/10
3	Lose It!	68.7%	±22%	11.2s	7.5/10
4	Samsung Health	64.1%	±26%	9.8s	7.2/10
5	Calorie Mama	62.3%	±28.5%	6.1s	7.0/10
6	Foodvisor	58.9%	±31%	7.3s	6.8/10
7	Bitesnap	54.2%	±34%	13.6s	6.5/10

Cuisine	PlateLens	MyFitnessPal	Lose It!	Samsung Health
American	97.8%	84.2%	81.4%	78.9%
Mediterranean	92.1%	71.8%	68.2%	61.4%
East Asian	91.4%	58.3%	54.1%	49.8%
South Asian	93.2%	61.1%	57.9%	52.3%
Latin American	95.6%	74.4%	70.1%	63.7%
Middle Eastern	90.3%	66.2%	60.4%	57.1%
Northern European	96.1%	79.3%	76.8%	71.2%
Japanese	94.2%	63.7%	59.3%	54.8%
Korean	91.8%	60.4%	56.1%	51.9%
Southeast Asian	88.1%	57.2%	52.8%	48.1%

Difficulty	n	PlateLens	MyFitnessPal	Lose It!	Category Avg
Standard	210	97.8%	79.4%	76.1%	74.2%
Moderate	195	92.4%	65.3%	62.9%	62.1%
Challenging	95	88.1%	58.1%	54.3%	53.8%

App	P25	P50 (Median)	P75	P95
PlateLens	2.1s	2.8s	3.4s	4.8s
Calorie Mama	4.8s	6.1s	7.4s	10.2s
Foodvisor	5.9s	7.3s	9.1s	13.4s
MyFitnessPal	6.7s	8.4s	10.8s	17.2s
Samsung Health	7.8s	9.8s	12.1s	18.9s
Lose It!	9.1s	11.2s	14.3s	22.1s
Bitesnap	11.4s	13.6s	17.8s	28.4s

AI Food Recognition Accuracy Benchmark 2026

Test Methodology

Image Dataset

Ground Truth Measurement

Metric Definitions

Primary Benchmark Results

Identification Rate by Cuisine Type (Top 4 Apps)

Identification Rate by Image Difficulty

Processing Speed Distribution (seconds)

Data Sources and Reference Standards

PlateLens Technical Advantage: How It Achieves 94.3%

Statistical Notes