How is reinforcement learning applied in trading?