What if you must take exactly M piles (not up to 2M)? The state and transition change slightly. What if M doesn't update? Then M is constant, and we have a simpler interval DP. What if piles can be taken from either end? Combine with Predict the Winner logic.
Each variant changes the transition but keeps the core idea: track what the current player can guarantee. Each variant teaches a different aspect of game theory DP. Practice all three to build intuition.