LIFEBENCH: A comprehensive benchmark for evaluating and improving length instruction following in large language models.