statistics « Ben Shoemate's Notebook

RSS

Posts Tagged ‘statistics’

The Mathematics of Changing Your Mind

06 Aug

The controversial history of the mathematical theorem that tells us when we should change our minds.

No Comments

Posted in Google Reader

Drupal contributor statistics

06 Jun

I recently extracted some data from the Drupal project's CVS and Git logs to see how the number of code contributors and total contributions have changed over time. If there was any doubt of our continual growth, the resulting charts demolish it.

Aggregated results from core and contributed modules.

Aggregated results from core and contributed modules.

As can be seen from the graphs, there is a pretty big spike in commit activity post-Git migration.

No Comments

Posted in Google Reader

jStat â€“ A Statistical Library With JavaScript

10 Apr

jStat is a JavaScript library which enables you to perform advanced statistical operations without the need of a dedicated statistical language.

Simply, it focuses on being a real JavaScript-based alternative for languages like R and MATLAB.

The library is standalone, however, for the plotting functionality, it requires jQuery, jQuery UI and jQuery-flot plugin.

Special Downloads:
Ajaxed Add-To-Basket Scenarios With jQuery And PHP
Free Admin Template For Web Applications
jQuery Dynamic Dragâ€™n Drop
ScheduledTweets

Advertisements:
Professional XHTML Admin Template ($15 Discount With The Code: WRD.)
Psd to Xhtml
SSLmatic â€“ Cheap SSL Certificates (from $19.99/year)

No Comments

Posted in Google Reader

The Unprecedented Rise of Apple iOS and Other Internet Trends [STATS]

16 Nov

Legendary Internet analyst Mary Meeker has some statistics she thinks every Internet executive should know, including that iOS is growing faster than almost any other Internet technology in history.

At the Web 2.0 Summit in San Francisco, the Morgan Stanley analyst led a rapid-pace presentation on the state of the Internet industry, revealing the state of mobile (Apple and Google are winning), the most under-monetized asset in online advertising (Facebook) and even the secret sauce of Steve Jobs (he has the mind of an engineer and the heart of an artist).

Some of Meekerâ€™s eye-popping stats:

46% of Internet users live in five countries: the USA, Russia, Brazil, China and India.
There are 670 million 3G subscribers worldwide, 136.6 million in the U.S. and 106.3 million in Japan.
iOS devices reached 120 million subscribers in 13 quarters, far faster than Netscape, AOL or NTT docomoâ€™s growth rates.
Nokia and Symbian used to own 62% of the smartphone market (units shipped). Now itâ€™s only 37%, mostly due to Android and iOS.
The average CPM for social networking sites is at only $0.55. Meeker thinks this will increase and normalize in the next few years. She also believes that inventory on Facebook is one of the most under-monetized assets on the web.
It took e-commerce 15 years to get to 5% of retail. Morgan Stanley predicts mobile should get to that same level in five years.
Streaming video is up to 37% of of Internet traffic during traditional â€œTV hours.â€ Netflix is the biggest contributor to this, followed by YouTube.
Seven of the companies that were in the top 15 publicly traded Internet companies in 2004 are not in that list in 2010.
Interest payments and entitlement spending is projected to exceed government revenue by 2025. In other words, the U.S. government is facing a real financial crisis soon.

Weâ€™ve included Mary Meekerâ€™s full presentation below. Let us know what you think of her statistics and trends in the comments.

Reviews: Android, Facebook, Google, Internet, YouTube

More About: apple, internet, Internet trends, iOS, Mary Meeker, Morgan Stanley, statistics, stats, steve jobs, W2S2010, Web 2.0 Summit

For more Business coverage:

Follow Mashable Business on Twitter
Become a Fan on Facebook
Subscribe to the Business channel
Download our free apps for iPhone and iPad

No Comments

Posted in Google Reader

Twitter API Calls Doubled Since April: Now Serving 70,000 Every Second

16 Sep

Have you ever wondered how much traffic Twitter handles in a given day, or what software sits behind the curtain of the popular service?Â A recent presentation reveals some of the answers. Twitterâ€™s incredible growth becomes obvious when you compare the recent numbers to those announced at Chirp.

On September 9th, Twitterâ€™s university recruiting team stopped by UC Berkeley to talk about the company and what it does. The slides from platform engineer Raffi Krikorianâ€™s talk, Twitter by the Numbers, are now online, and they disclose some fascinating technical details about the social media giantâ€™s operations.

Twitter serves over 70 million tweets per day, totaling over 12GB of tweet text alone.Â Many of those messages are delivered to client apps and web sites through the Twitter API to the tune of six billion API calls per day (double what was announced at Chirp in April), or about 70,000 API calls per second.Â All told, the service generates 8TB of data every day, which is eight times more than the New York Stock Exchange.

Pop quiz, engineers: Your web service needs to deliver real-time message traffic to an asymmetric digraph of over 150 million users.Â What database do you use?Â WHAT DATABASE DO YOU USE?

Shoot the hostage
Oracle
MySQL
Write your own database

If youâ€™re Twitter, the correct answer is #4:Â Create your own database software, call it FlockDB, and release it on github.Â (By the way, if you picked option 1, maybe software engineering isnâ€™t the right career choice for you.)

FlockDB is just one of the home-grown, high-performance software systems Twitter uses to support its tremendous growth.Â Others include:

hosebird, a â€œnear real-timeâ€ streaming API back-end (instead of REST, which is only â€œpseudo real-timeâ€); and
snowflake (also on github), a network service to generate unique IDs at high scale (MySQL couldnâ€™t keep up, and was a single point of failure).

With a stated goal of supporting â€œhalf the world and all its devices,â€ Twitter faces many engineering challenges.Â This peek under the hood (full slides are embedded below) shows that theyâ€™re aware of the potential problems, and are working hard to steer clear of the fail whale.

Via Delyn Simons