How Not to Measure AI Productivity