Find positions where chess engines significantly disagree on evaluation. As a shortcut, consider only a database of human games, and 1 ply away.
Slightly tricky is evaluation values are merely ordered and only rankings within an engine can be strictly compared.
Analyze the internals of engines? Seems mind-bogglingly difficult.
No comments :
Post a Comment