In DiD, standard errors must be clustered at the unit level that received the treatment.
Bertrand, Duflo & Mullainathan (2004) showed that DiD studies that cluster at too fine a level (e.g., individual) severely overreject the null. The reason: serial correlation within units over time inflates the effective number of independent observations.
Rule of thumb: cluster at the level at which treatment varies. If a state-level policy changed, cluster by state. If a firm-level policy, cluster by firm.
Few clusters problem. With fewer than ~30 clusters, standard cluster-robust SEs can be undersized. Use the wild cluster bootstrap (Cameron, Gelbach & Miller 2008) for inference with few clusters.
With staggered treatment, cluster at the unit level (e.g., state) and consider two-way clustering (state × time) if there is reason to believe correlated shocks across units within periods.