Error analysis is the meta-loop, not a stage. Rubrics and judges are downstream of it. Before you measure anything reliably, you read traces and write down what is wrong; that reading is what tells you what is worth measuring, and the read happens on a cadence, not once. Hamel's "look at your data, then look again" credo is the entire job description; everything else on this site is in service of it.
The chapters here cover the practice. The 60-80% rule names the prior that makes the work tractable: three failure modes usually account for most of your bugs. Open coding is the method that finds those modes. Dimensional sampling is the way you build inputs that cover the corners. Failure-mode taxonomies are the structured outputs of open coding. The trace viewer is the single highest-return tool you build to support the loop, and the NurtureBoss case study shows the loop closing in production.
Chapters: