WSJ Graphs Showing Increases in Students Studying Statistics

A recent WSJ special on Big Data had a section call Help Wanted!  The first paragraph reads:

“Corporate executives face a daunting obstacle when it comes to reaping the benefits of big data: Who’s going to tell them what it all means?”

This is a real problem.  As firms collect more and more data, they will need people who can productively make sense of it.

Fortunately, more people are getting trained in statistics (and analytics) to meet this demand.  The WSJ ran an article called “Data Crunchers Now the Cool Kids on Campus” (the statistics are from that article.)

And, IBM, Ohio State, and others announced plans for an advanced analytics center in Columbus, OH to help train, do research, and help firms with analytics.

There is a lot of opportunity for people who can work with and make sense of data.

Karl Kempf Profile in Businessweek

One of my favorite columns in Businessweek is the profile of someone doing something interesting in technology.

I was very pleasantly surprised to our field of optimization pop up last May (I meant to post earlier, but was only reminded because I am using this example in my optimization class tomorrow).  Karl Kempf of Intel was profiled because of his ability to use mathematical optimization to improve Intel (and make race cars go faster and movie stunts more realistic).

BusinessWeek recently ran an article on Coca Cola’s new orange juice plant.  Although this article doesn’t give the details of their secret production formula (which they call Black Book), it looks like a nice application of the classic linear programming blending problem.  It is also interesting to see all the other data collection and analytics that go into the process.

Here is a key passage from the article:

Black Book isn’t really a secret formula. It’s an algorithm. Revenue Analytics consultant Bob Cross, architect of Coke’s juice model, also built the model Delta Air Lines (DAL) uses to maximize its revenue per mile flown. Orange juice, says Cross, “is definitely one of the most complex applications of business analytics. It requires analyzing up to 1 quintillion decision variables to consistently deliver the optimal blend, despite the whims of Mother Nature.”

The Black Book model includes detailed data about the myriad flavors—more than 600 in all—that make up an orange, and consumer preferences. Those data are matched to a profile detailing acidity, sweetness, and other attributes of each batch of raw juice. The algorithm then tells Coke how to blend batches to replicate a certain taste and consistency, right down to pulp content. Another part of Black Book incorporates external factors such as weather patterns, expected crop yields, and cost pressures. This helps Coke plan so that supplies will be on hand as far ahead as 15 months. “If we have a hurricane or a freeze,” Bippert says, “we can quickly replan the business in 5 or 10 minutes just because we’ve mathematically modeled it.”


Economist Feb 9 2013 Print Edition

The Economist recently published an interesting article on how advances in security cameras are allowing stores to better track and respond to consumers.

It seems like it wasn’t too long ago that an average 3-year-old could easily beat a computer at determining someone’s gender by looking at a picture.  Now, these cameras are getting better at determining gender as well as age of the shoppers in the store.

This advance really allows in-store retailers to catch-up with their on-line counterparts in terms of understanding customers:  where do they spend their time, how long do they stay, what do they look at, and what do they ultimately buy?

The article provides an example of a retailer who determined when the number of shoppers peaked (it wasn’t when sales peaked) and built their staff schedules around that to generate more sales.

There is a lot more retailers can do with this information.  It will be interesting to see how this evolves over the next few years.

DC Water suppliers water to 2 million customers in the Washington DC area with several thousand miles of pipes (with an average age of over 75 years) and maintains nearly 9000 fire hydrants.  Several years ago, everything was tracked on paper.

IBM and DC Water have recently published several videos and articles on how DC Water has applied descriptive, predictive, and prescriptive analytics to the system with positive results.

This is a nice case study in the use of analytics.

The descriptive analytics allowed DC Water to map the location of the fire hydrants for better maintenance.  With extra sensors it also allowed them to better monitor water useage and look for anomalies

With aging pipes, the predictive analytics allowed them to predict failures before they happened.

And, the prescriptive analytics allowed DC Water to better route maintenance crews to fix trouble tickets increasing the productivity of the maintenance team while driving down fuel cost.

IBM published several videos on the solution>

This first video is a nice high-level overview of the solution:

The second video provides more details of the solution:

The third video highlights the predictive analytics solution:


IBM also published a written document of the solution, but I found the videos very well done.