Our month long experiment with allowing /r/askreddit here again has ended, please provide your feedback regarding it in this thread.
User claims that yoga cures cancer citing research papers. Another user actually goes through the papers and shows how that's not the case
Redditor experiences mishap in bus and blames the driver. Another redditor who happened to be in the very same bus gives a better explanation as to what happened.
Redditor explains why the US recruited Nazi scientists, and lists some of the more notable contributors to the US space program
/u/LarBrd33 explains the secret origin of Fergie and the Black Eyed Peas
/u/signsandwonders calculates the average of all the presidents birthdays - the answer is July 4th.
Redditor misses the mark on a GE stock prediction, eats his sock as promised, posts video proof
Redditor explains what Australia did to prevent gun related deaths.
Florida teacher describes a fire drill at their school today where everyone refused to leave their rooms. Other teachers comment about how scared their students are across the country.
/u/the_straw-man compares current day USA with "how the Roman Republic turned into the Roman Empire". A brief conversation on Roman history ensues.
Redditor knows what Elon Musk is really up to
Reddator describes how a dislocated shoulder kept him out of the Boston bombing marathon
/u/datraceman gives a very accurate description of what a megachurch in the city of Birmingham, AL is really like.
Programmer lists his plan to churn out dozens of slight variations of the same app
Hotel Front Desk Staff Eats Dinner With Man Whose Wife is in the Neighboring Hospital Every Evening for 2 Months.
American asks UK citizens the etiquette of paying in UK strip clubs, almost every answer comes with expected British humour.
/u/SkiThe802 explains how sports timing works in racing events and thus why there was a tie for gold in bobsled
70 year old grandma gives advice about life
Redditor breaks down in incredible detail how to be better at competitive pokemon
/u/NoSuch explains why college students might actually be overcooking and burning popcorn on purpose
True Englander reminds people what being English is all about.
/u/whitexknight has a thought provoking plan for eternity as an immortal
Succinct counter-argument to the defense against tyranny intent of the 2nd amendment
/u/918AmazingAsian breaks down what went wrong with BvS
In the wake of news that Barnes and Noble is circling the drain, commentator lists a DOZEN ways the company has failed to innovate over the past decade
User /u/loser4455 explains how the symbiotic relationship with bacteria in our mouth works and a hilarious thread of replies ensues
TIFU thread turns into a demonstration of impressive vocabulary
Redditor gives worst case scenario to an underage party
Redditor describes the most beautifully suicidal animal on earth.
Redditor briefly explains how stocks in a company works.
Video of children in Africa seeing/hearing a fiddle for the first time. Real life fiddler shows up in Reddit.
A man explains how to effectively catch pigeons in the city
Redditor thought Alexa called 911 when he asked it to play Garth Brooks, but it was actually part of the song
Comment chain of puns
American Redditor offers $1000 to aid in return of lost/stolen Irish dog
Redditor writes a poem about a bench
Political Parody of Old Spice Commercial.
Redditor details how effective the Russian propaganda campaign was and how it prompted them to change their vote in 2016
u/RunDNA shows a list of 241 posts from T_D that have been linked to one of the fake Russian twitter accounts from yesterday’s indictment. Top comment by u/f_k_a_g_n actually shows over 1,000 posts from T_D linked to the same twitter account.
User gives every reply. Yes, all of them.
Redditor debunks very common misconception about "heritability" in genes
220 0 DrLionelRaymond 10/10. Would hire. (I run the R&D wing of a data science company).
29 0 bokononpreist Did you find a difference between home and away games?
76 0 SpecialK_714 Ol' Roy already knew this tho
17 0 zamstat I hope it is okay if I play the role of skeptic. It's not that I don't think you did good work - this is a fantastic write-up - but there are a few things that make me hesitant. * Was your final sample size ~450 with only ~30 runs without timeouts? Given that there are over 5000 games in a given season, that seems small to me. Less than 10% of all games contain a run that might merit a timeout? If it is the right sample size, I am also surprised that a timeout was called in 95% (~30/450) of scenarios where a team went on a run of 6+ points in a short timeframe. I cannot tell from your code whether you limited the analysis to only those games where the final overall scoring margin during the game was greater than 5 points. You mentioned it in your documentation, but I could not verify whether it was implemented. I agree with your inclination to attempt to control for 'guarantee game' blowout where teams are likely to go on 6+ runs, the opponent is less likely to calla timeout, and the run is likely to continue. However, I don't think the solution is to restrict the dataset to only those games that ended up close. You lose some relevant timeout scenarios (e.g., team that is up large/has opponent go on 6pt run/calls timeout/proceeds to blow them out) as well as potentially bias the control scenarios. * I must admit that I am not 100% clear on your methodology. I think I understand how you assess the stoppage of runs for settings with a timeout: you start the time-window at the timeout and track performance over the next 10 possessions. I am less clear on how you handle non-timeouts. Do you start the clock once the run becomes official (i.e., 6+ points) or when it hits its peak? If either of these are true, I believe you might be giving an advantage to the non-timeout group. As a quick check, it would be interesting to look at the summary statistics regarding the run size between the timeout and non-timeout groups. * Have you considered a matching study design? That is, for every scenario where a timeout was called, find a similar scenario where a timeout was not called. You could match on the current scoring differential, the time left in the game, and even on how the run progressed (e.g., made 2, miss, made 3, steal, made 3). * Have you considered working with expected win probabilities? Given that you scrapped play-by-plays for every game, you could likely create empirical win probability tables for every second of a 40 minute game. I see two key advantages here. I personally think this better quantifies a run (is a game 'slipping' away?) compared to a simple point differential. I am skeptical of 6+ points being classified as a run as a blanket statement. Also, it may help eliminate the 'blowout' scenarios. Again, I think this was a great investigation. I especially want to thank you for sharing your code. *Edits for formatting*
11 0 Barnhard This is impressive.
8 0 hesnothere So I guess what you're saying is, Roy knew?
6 0 [deleted] Good work. Someone should pay you for this.
6 0 [deleted] [deleted]
3 0 ljfdlksdjfhkl A few comments. A couple of preliminaries, first on notation: p(x|y) is not the probability that x AND y occur. It is the probability that x occurs GIVEN that y has already occurred. Bayes rule is all about this: p(x and y) [often written p(x ∩ y)] = p(x|y)*p(y) = p(y|x)*p(x). Secondly, keep in mind that this is an observational study, and so inferring causality is tricky. In particular, p(run ended|timeout) is NOT the "probability that calling a timeout is responsible for ending a run". It is the probability that the run ended given that a timeout was called, and says nothing directly about causal links. I think your calculations of p(RE|T) are correct, but two things: (a) the denominator in your Bayes equation is actually just p(timeout called), which you could calculate directly and simplify your code. But also (b), do you need to do the Bayes bit at all? Can't you calculate p(RE|T) directly from the data, as you are doing now for p(T|RE)? [i.e. find all situations where a timeout was called, and tabulate the proportion of those where the run ended = p(RE|T)]. Either way, the absolute value of p(RE|T) is not particularly informative (so your statement "as low as a 22% chance that timeouts are responsible for ending runs" is a little misleading). Imagine a situation where the probability of a run ending (without timeout) is 0.1, but it's 0.2 when a timeout is called. That would seem to be pretty good evidence that calling a timeout is associated with an increased probability of the run ending, even though the probability of the run ending is still quite low in absolute terms. Which brings us back to causality. Ideally you want to find situations where timeouts were called, and comparable situations where timeouts were not called, and look for evidence of a difference in the two sets of outcomes. I realize that you already know this, I'm just writing it down. Having found that evidence, it's down to interpretation and judgement as to whether or not the timeout was the causal mechanism. So the more relevant interpretation of p(RE|T) would be to look at p(RE|T)-p(RE|not T). A positive value would indiate a potential positive effect of timeout. This is what you're doing in the last figure. But your differences are negative (p(RE|not T) > p(RE|T) ?), with timeouts being associated with a slightly lower probability of the run ending. Which might be genuine (perhaps timeouts tend to be called when coaches get desperate and the run is basically unsalvageable, so the fact that the timeout has been called is really an indicator of the calling team being outplayed, rather than any indication of the timeout itself having a negative effect on run ending). But also I wonder if the way you are viewing runs is slightly problematic - runs of up to 10 events seems quite long, and the chances of a timeout being called in that span are quite high (e.g. look at your score ratio=1.0 numbers: you have 456 runs, in 430 of which a timeout was called). So (a) you have very little data about what happens when timeouts are NOT called, which makes it difficult to make inferences about the effect of timeout. And (b) there is no distinction between a timeout called early in the run vs a timeout called late in the run. Maybe there is some value in shortening your 10-event window, and/or comparing runs of given length (e.g. find all runs that extended for at least 3 events. Calculate the proportion of runs that ended after timeout was called AT 3 events, and the proportion that ended at 3 events without timeout. Repeat for different run lengths). Final comment, it might be worth repeating this in a statistical modelling framework and comparing results. It would be fairly straightforward: fit a binomial generalized linear model where the outcome (run ended) is a function of whether timeout was called (true or false), and look at the coefficient of the timeout term (and test its significance, if you like). Sorry all of that sounds rather negative, it's not meant to be. There's definitely some interesting results in there, just needs some refining to bring them out. You might be interested in this study, which looks at a closely-analogous situation of timeouts in volleyball matches: [detail]( and [shorter summary]( Basically - there's not a lot of evidence that timeouts help in volleyball, either. (Edit: formatting)
3 0 barrio-libre Please forward to Sean Miller.
3 0 purpletraitor69 how would I know?
2 0 TotesMessenger I'm a bot, *bleep*, *bloop*. Someone has linked to this thread from another place on reddit: - [/r/bestof] [\/u\/Chu\_BOT posts rigorous statistical analysis of whether timeouts affect scoring runs in basketball. Potentially gets a job offer on the spot.]( - [/r/theydidthemath] [\[RDTM\] Do timeouts stop scoring runs? (a thrilling statistical analysis) (x-post from r\/collegebasketball)]( [](#footer)*^(If you follow any of the above links, please respect the rules of reddit and don't vote in the other threads.) ^\([Info](/r/TotesMessenger) ^/ ^[Contact](/message/compose?to=/r/TotesMessenger))* [](#bot)