November | 2011

Monthly Archives: November 2011

Typesetting Math for the Web

By Torben Nielsen | Published November 23, 2011

Typography matters. Bad typography can be as much of a barrier to the reader as bad writing. Conversely, good typography can simplify the presentation of complex content. This is especially true of mathematical formulae.

Unfortunately, there doesn’t seem to be a standard way to decently render formulae for the web — and basic HTML simply isn’t expressive enough. MathML seems to be a fair attempt at a standard, but like any XML language, it is overly verbose and hard to read. Also — not an entirely fair measure of quality, but nevertheless of practical concern — there doesn’t seem to be MathML support in WordPress, nor any easy way to enable it.

Instead, developers seem to have rallied around the idea of rendering LaTeX to an image that can then be included. This at least allows the formulae to be viewable with practically any device — but if you care even the slightest about typography, you will lie awake at night because of the alignment, spacing, typeface, and scaling issues that come from trying to make an image, not look like an image.

These issues are not inherent to the rendering, but rather a consequence of the way images and text interact in HTML. Indeed, the rendering has its own alignment and spacing issues, that varies between rendering services. To my eye, the best service is the Google Visualization API, which has LaTeX rendering as an undocumented feature. That’s not to say that Google’s rendering is without issues — it has a tendency to place symbols too close together, and has no apparent way to influence even basic formatting.

An alternative to this is MathJax, which uses a combination of HTML, CSS and Web fonts to render formulae as text. In theory, this should alleviate many of the issues from image-based approach. In practice, the typography is horrendous — most notably, nearly every character, number and symbol is italicized. Bad!

As a compromise, I have settled for using basic HTML formatting when possible (e.g. for subscript) — this makes inline formulae appear fairly coherent with their surroundings — and a LaTeX plugin for WordPress. This seems a fair compromise between convenience and quality, but a compromise nonetheless.

$Formulae rendered by WP-Latex$

Formulae rendered by WP-Latex

Formulae rendered by MathJax.

Formulae rendered through the Google Visualization API.

Posted in Editorial | Tagged math, typography, wordpress | Leave a comment

The TV Show Rerun Paradox

By Torben Nielsen | Published November 22, 2011

We all know the feeling; TV stations seem to be showing the same episodes of your favorite show over and over again. While the conspiracy-theorist in me would love to believe this is true, there’s actually a very good reason for this.

Remember when you were in school, and despite the apparent unlikelihood (after all, there are quite a few days in a year), two of your classmates had their birthdays on the very same date? Actually, this isn’t unlikely at all; in a group of 23 or more people there is a 50% chance that at least two of their birthdays will coincide. For 57 or more people the chance is 99%! This is commonly known as “the birthday paradox”, although it’s not really a paradox at all.

The same principle applies to TV show episodes, and since most series have a lot less than 365 episodes, the probabilities are actually even higher. We’ll do the calculations for a few well-known shows, but first let’s see how it works.

Let E be the set of episodes for a given show. We denote the number of episodes by |E| (read: “the size of E”). Now, for a given number of seen episodes to all be different, they must be pairwise distinct. Let’s calculate the probability p_E(n) that n randomly chosen episodes of E are pairwise distinct.

$p_E(n)=1\ \cdot\ \left(1-\frac{1}{|E|}\right)\ \cdot\ \left(1-\frac{2}{|E|}\right)\ \cdots\ \left(1-\frac{n-1}{|E|}\right)\ =\ \frac{|E|!}{|E|^n(|E|-n)!}$

This calculation can be understood as the probability of choosing an unseen episode n consecutive time. The probability of seeing the same episode twice when watching n episodes of E is then given by 1-p_E(n). Let’s study this for a few well-known TV shows.

For Friends, which has 236 episodes, the number of episodes required for a 50% chance of a repeat is 19, and watching 46 episodes gives a 99% chance. For House, currently at 162 episodes, these numbers are 16 and 38 episodes respectively. For America’s longest running sitcom The Simpsons, currently at 492 episodes, watching 27 episodes gives a 50% of a repeat sneaking in; and watching 67 episodes brings this up to 99%!

So the next time you tune in to your favorite syndicated TV show and are disappointed that you’ve already seen the episode, you can feel comforted that it’s not the network’s fault — it’s just math.

Extra credit: The birthday paradox can also be applied to why your iPod shuffle apparently keeps choosing the same songs to play. Although, that choice also seems to be influenced by the law that any random choice within a playlist will pick the worst song in the list.

Posted in Humor | Tagged math, tv | Leave a comment