Consider all 1000-element subsets of the set {1,2,3,...,2015}. From each such subset choose the least element. The arithmetic mean of all of these least elements is qp, where p and q are relatively prime positive integers. Find p+q.
Hint
Use the Hockey Stick Identity in the form
(aa)+(aa+1)+(aa+2)+⋯+(ab)=(a+1b+1).
(This is best proven by a combinatorial argument that coincidentally pertains to the problem: count two ways the number of subsets of the first (b+1) numbers with (a+1) elements whose least element is i, for 1≤i≤b−a+1.)
解析
Solution 1
Let M be the desired mean. Then because (10002015) subsets have 1000 elements and (9992015−i) have i as their least element,
(10002015)M=1⋅(9992014)+2⋅(9992013)+⋯+1016⋅(999999)=(9992014)+(9992013)+⋯+(999999)+(9992013)+(9992012)+⋯+(999999)…+(999999)=(10002015)+(10002014)+⋯+(10001000)=(10012016).
Using the definition of binomial coefficient and the identity n!=n⋅(n−1)!, we deduce that
M=10012016=143288.
The answer is 431.
Potential Inspiration
For solution 1, the inspiration could be that once we select the set {a1,a2,...,a2015}, where a1,wewanttomultiplyeachsuchsetbya_1$ and sum up through all such sets.
So, how do we scale each such set by a1? We realize that if we were to choose one more element, specifically, less than a1, this could be done in a1−1 ways.
Oh! So, now if we add 1 to all the numbers in our set, they become {a1+1,a2+1,...,a2015+1}, so the number of ways to pick another number below the least number in this set is a1!
Solution 2
Each 1000-element subset {a1,a2,a3,...,a1000} of {1,2,3,...,2015} with a1contributesa_1tothesumoftheleastelementofeachsubset.Now,considertheset\left\{a_1+1,a_2+1,a_3+1,...,a_{1000}+1\right\}.Therearea_1waystochooseapositiveintegerksuchthatk (k can be anything from 1 to a1 inclusive). Thus, the number of ways to choose the set {k,a1+1,a2+1,a3+1,...,a1000+1} is equal to the sum. But choosing a set {k,a1+1,a2+1,a3+1,...,a1000+1} is the same as choosing a 1001-element subset from {1,2,3,...,2016}!
Thus, the average is (10002015)(10012016)=10012016=143288. Our answer is p+q=288+143=431.
Solution 3(NOT RIGOROUS)
Let p be the size of the large set and q be the size of the subset (i.e. in this problem, p=2015 and q=1000). We can easily find the answers for smaller values of p and q:
For p=2 and q=2, the answer is 1.
For p=3 and q=2, the answer is 34.
For p=4 and q=2, the answer is 35.
For p=3 and q=3, the answer is 1.
For p=4 and q=3, the answer is 45.
For p=5 and q=3, the answer is 23.
At this point, we can see a pattern: our desired answer is always q+1p+1. Plugging in p=2015 and q=1000, the answer is 10012016=143288, so 288+143=431.
Solution 4 (short)
In the "average case", the numbers evenly partition the interval [0,2016] into 1001 parts. Then because it asks for the expected value of the least element the answer is 10012016.
-tigershark22
VIEWER NOTE: This solution doesn't always work, for example, take n2 on [0,1]. The "average case" is n=21⟹n2=41 but integrating n2 from 0 to 1 we see that definitely is not the case. This works only in certain situations, so maybe this solution would be better off with proof that this is one of those situations.
Solution 5
When it says average, it really just means expected value. Let's do casework.
Our first case is when the smallest number in the subset is 1. There are 2014 numbers to choose from, and 999 numbers left to choose. So, this contributes (9992014) to our expected value.
Then, for the smallest number being 2, we have 2013 numbers to choose from and 999 numbers left to choose. So, this contributes 2(9992013).
Similarly, the smallest number being 3 contributes 3(9992012) to our expected value. We keep this up all the way to 1016(999999).
This makes our sum
(9992014)+2(9992013)+3(9992012)+⋯+1016(999999).
To evaluate this, we can split each binomial into groups of 1, like this:
(9992014)+(9992013)+(9992013)+(9992012)+(9992012)+(9992012)+⋯+1016 (999999)’s(999999)+(999999)+⋯+(999999).
Now, we can turn this into:
((9992014)+(9992013)+⋯+(999999))+((9992013)+(9992012)+⋯+(999999))+⋯+((9991000)+(999999))+(999999).
Now, by the Hockey Stick Identity, we can further simplify this into
(10002015)+(10002014)+⋯+(10001001)+(10001000).
Further simplifying this, we get
(10012016).
If this is our sum and there are (10002015) possible subsets, our average is simply