Multi-goal multi-copy reinforcement learning