Human Belief State-Based Exploration and Exploitation in an Information-Selective Symmetric Reversal Bandit Task