Paper accepted at COLM 2025: Have Large Language Models Learned to Reason? A Characterization via 3-SAT