Angel \”Java\” Lopez on Blog

July 8, 2015

Big Data: Links, News And Resources (7)

Filed under: Big Data, Links — ajlopez @ 10:37 am

Previous Post

SAMOA: A Platform for Mining Big Data Streams
http://www2013.org/companion/p777.pdf

DevOps Round-Up: Hadoop and Big Data Analytics Get a Boost From Splunk | DevOpsANGLE
http://devopsangle.com/2013/05/27/devops-round-up-hadoop-and-big-data-analytics-get-a-boost-from-splunk/

What The Hell is… Big Data? | LinkedIn
http://www.linkedin.com/today/post/article/20130527063838-64875646-what-the-hell-is-big-data

Cloud, Big Data and Mobile: Understanding Amazon Elastic Load Balancing in Detail
http://harish11g.blogspot.in/2013/05/Understanding-Amazon-Elastic-Load-Balancing-ELB-in-detail-with-usecases.html

El Big Data y la dilución de la política by Alejandro Piscitelli on Prezi
http://prezi.com/wltuvoyiapgi/el-big-data-y-la-dilucion-de-la-politica/

What the ‘Internet of things’ really means | Consumerization Of It – InfoWorld
http://www.infoworld.com/d/consumerization-of-it/what-the-internet-of-things-really-means-217657

Big Data—for better or worse
http://phys.org/news/2013-05-big-datafor-worse.html

Six disruptive possibilities from big data – Strata
http://strata.oreilly.com/2013/05/six-disruptive-possibilities-from-big-data.html

Spark | Lightning-Fast Cluster Computing
http://spark-project.org/

Learning Spark – O’Reilly Media
http://shop.oreilly.com/product/0636920028512.do

Reactor – a foundation for asynchronous applications on the JVM | SpringSource Team Blog
http://blog.springsource.org/2013/05/13/reactor-a-foundation-for-asynchronous-applications-on-the-jvm/

How NoSQL, MySQL and MongoDB worked together to solve a big-data problem
http://www.theserverside.com/feature/How-NoSQL-MySQL-and-MogoDB-worked-together-to-solve-a-big-data-problem

Big Data – Hadoop – BIDOOP | PRAGSIS Big Data Hadoop
http://bigdata-hadoop.pragsis.com/pages/2/big_data_hadoop_bidoop?language=es

Introduction to HCatalog, Pig scripts and heavy burdens | Alejandro Jezierski
http://blogs.southworks.net/ajezierski/2013/04/09/introduction-to-hcatalog-pig-scripts-and-heavy-burdens/

Developing Big Data Solutions on Windows Azure, the blind and the elephant | Alejandro Jezierski
http://blogs.southworks.net/ajezierski/2013/03/15/developing-big-data-solutions-on-windows-azure-the-blind-and-the-elephant/

My Links
http://delicious.com/ajlopez/bigdata

Stay tuned!

Angel “Java” Lopez
http://www.ajlopez.com
http://twitter.com/ajlopez

July 2, 2015

Big Data: Links, News And Resources (6)

Filed under: Big Data, Links — ajlopez @ 10:47 am

Previous Post
Next Post

Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing
http://www.cs.berkeley.edu/~matei/papers/2011/tr_spark.pdf

¿Que pasa con BigData en Argentina? | IT on business!
http://itonbusiness.wordpress.com/2013/03/04/que-pasa-con-bigdata-en-argentina/

Big Data Developers in Buenos Aires (Buenos Aires) – Meetup
http://www.meetup.com/Big-Data-Developers-in-Buenos-Aires/?gj=ej1b&a=wg2.1_grpn

A programmer’s guide to big data: 12 tools to know — Tech News and Analysis
http://gigaom.com/2012/12/18/a-programmers-guide-to-big-data-12-tools-to-know/

Running the Largest Hadoop DFS Cluster
http://www.infoq.com/presentations/Hadoop-HDFS-Facebook

Thoughts on AWS Redshift… | Database Fog Blog
http://robklopp.wordpress.com/2013/03/11/thoughts-on-aws-redshift/

Amazon preparing ‘disruptive’ big data AWS service? • The Register
http://www.theregister.co.uk/2013/02/19/amazon_new_big_data_aws_service/

Incremental computing – Wikipedia, the free encyclopedia
http://en.wikipedia.org/wiki/Incremental_computing

Realtime vs Long Term Data Analysis with Storm/Hadoop/Cassandra – storm-user | Google Groups
https://groups.google.com/group/storm-user/browse_thread/thread/4426ad919c1eb3bd

The history of Hadoop: From 4 nodes to the future of data — Tech News and Analysis
http://gigaom.com/2013/03/04/the-history-of-hadoop-from-4-nodes-to-the-future-of-data/

Understanding the Parallelism of a Storm Topology – Michael G. Noll
http://www.michael-noll.com/blog/2012/10/16/understanding-the-parallelism-of-a-storm-topology/

Big Data Lets You Profile and Recruit the Best Employees | SmartData Collective
http://smartdatacollective.com/kathryn1723/108026/big-data-lets-you-profile-and-recruit-best-employees

Push Technology
http://www.pushtechnology.com/

BigData Spain – Home
http://www.bigdataspain.org/en/

Big Jobs
http://www.bigdataspain.org/jobs/en/

‘Big data’ is dead. What’s next? | VentureBeat
http://venturebeat.com/2013/02/22/big-data-is-dead-whats-next/

How to Build Big Data Pipelines for Hadoop Using OSS
http://www.infoq.com/presentations/Big-Data-Pipelines-Spring

How Netflix is turning viewers into puppets – Salon.com
http://www.salon.com/2013/02/01/how_netflix_is_turning_viewers_into_puppets/

My Links
http://delicious.com/ajlopez/bigdata

Stay tuned!

Angel “Java” Lopez
http://www.ajlopez.com
http://twitter.com/ajlopez

December 25, 2013

Big Data: Links, News And Resources (5)

Filed under: Big Data, Links — ajlopez @ 5:12 pm

Previous Post
Next Post

The Big Bang: How the Big Data Explosion Is Changing the World
http://www.microsoft.com/en-us/news/features/2013/feb13/02-11BigData.aspx

Customers Rapidly Adopting Big Data Solutions — Driven By Marketing, Sales and More — Reports New Microsoft Research
http://www.microsoft.com/en-us/news/Press/2013/Feb13/02-11BigDataRoundupPR.aspx

Structure:Data | GigaOM Events
http://event.gigaom.com/structuredata/

CERN Data Centre passes 100 petabytes | CERN
http://home.web.cern.ch/about/updates/2013/02/cern-data-centre-passes-100-petabytes

DARPA puts $3M into startup pushing big data in Python — Tech News and Analysis
http://gigaom.com/2013/02/05/darpa-puts-3m-into-startup-pushing-big-data-in-python/

Click Dataset | Center for Complex Networks and Systems Research
http://cnets.indiana.edu/groups/nan/webtraffic/click-dataset

Python for Data Analysis: Wes McKinney: 9781449319793: Amazon.com: Books
http://www.amazon.com/Python-Data-Analysis-Wes-McKinney/dp/1449319793

Iteratees in Big Data at Klout « Klout Engineering
http://engineering.klout.com/2013/01/iteratees-in-big-data-at-klout/

Big Data is over the hype – can we get on with real work now? | Capping IT Off | Capgemini
http://www.capgemini.com/technology-blog/2013/01/big-data-hype-real-work/

Event Driven Architecture | Inside Analysis
http://www.insideanalysis.com/research/event-driven-architecture/

Disk-Locality in Datacenter Computing Considered Irrelevant (and then what?)
http://www.cs.berkeley.edu/~ganesha/talks/disk-irrelevant.pdf

Technical Discovery: Passing the torch of NumPy and moving on to Blaze
http://technicaldiscovery.blogspot.ca/2012/12/passing-torch-of-numpy-and-moving-on-to.html

A Python Compiler for Big Data
http://continuum.io/blog/blaze

GigaSpaces | High Scalability with GigaSpaces XAP & Cloudify – Your Open PaaS Stack for Business Apps
http://www.gigaspaces.com/

Intuit CEO: Big Data Can Be “The Great Equalizer”
http://readwrite.com/2012/12/14/intuit-big-data-doesnt-have-to-crush-consumers-small-businesses

Precog
http://www.precog.com/
Precog is a powerful analytics platform for JSON data. Stream, upload, or synchronize data into Precog, and perform advanced analytics usingLabcoator our simpleREST APIs.

Amazon Redshift
http://aws.amazon.com/redshift/
Amazon Redshift is a fast, fully managed, petabyte-scale data warehouse service that makes it simple and cost-effective to efficiently analyze all your data using your existing business intelligence tools. It is optimized for datasets ranging from a few hundred gigabytes to a petabyte or more and costs less than $1,000 per terabyte per year, a tenth the cost of most traditional data warehousing solutions.

Expanding the Cloud – Announcing Amazon Redshift, a Petabyte-scale Data Warehouse Service – All Things Distributed
http://www.allthingsdistributed.com/2012/11/amazon-redshift.html

High Scalability – BigData using Erlang, C and Lisp to Fight the Tsunami of Mobile Data
http://highscalability.com/blog/2012/11/26/bigdata-using-erlang-c-and-lisp-to-fight-the-tsunami-of-mobi.html

Keynote: Spring 2012 and Beyond
http://www.infoq.com/presentations/SpringOne-2GX-2012-Keynote-1
Adrian Colyer, Juergen Hoeller, Mark Pollack and Graeme Rocher present SpringSource’s Unifying Component Model, current developments regarding Big Data, and betting on Grails.

My Links
http://delicious.com/ajlopez/bigdata

Keep tuned!

Angel “Java” Lopez
http://www.ajlopez.com
http://twitter.com/ajlopez

December 23, 2013

Big Data: Links, News And Resources (4)

Filed under: Big Data, Links — ajlopez @ 5:15 pm

Previous Post
Next Post

HBase Architecture 101 – Storage
http://www.larsgeorge.com/2009/10/hbase-architecture-101-storage.html

Culturomics 2.0: Forecasting large-scale human behavior using global news media tone in time and space
http://www.uic.edu/htbin/cgiwrap/bin/ojs/index.php/fm/article/view/3663/3040

Zillabyte
http://zillabyte.com/
Smart Sales Prospecting

SQLFire: Scalable SQL instead of NoSQL
http://www.infoq.com/presentations/SQLFire-Scalable-SQL-instead-of-NoSQL

Karmasphere 2.0
https://karmasphere.com/what-is-karmasphere
Collaborative Analytics Workspace on Hadoop with Self-Service for Everyone in the Business

MapR
http://www.mapr.com/
MapR delivers on the promise of Hadoop, making managing and analyzing Big Data a reality for more business users.

Big Data Now: Current Perspectives from O’Reilly Radar
http://shop.oreilly.com/product/0636920022640.do

Big Data and Human Judgment
http://www.cquotient.com/big-data-and-human-judgment/

The Petabyte Age: Because More Isn’t Just More — More Is Different
http://www.wired.com/science/discoveries/magazine/16-07/pb_intro

Strata New York Speaker Slides & Video
http://strataconf.com/stratany2011/public/schedule/proceedings

Six Provocations for Big Data
http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1926431

Apache Giraph
http://incubator.apache.org/giraph/
For general-purpose big data computation, the map-reduce computing model has been well adopted and the most deployed map-reduce infrastructure is Apache Hadoop. We have implemented a graph-processing framework that is launched as a typical Hadoop job to leverage existing Hadoop infrastructure, such as Amazon’s EC2. Giraph builds upon the graph-oriented nature of Pregel but additionally adds fault-tolerance to the coordinator process with the use of ZooKeeper as its centralized coordination service.

Video: 3 Big Data Tech Talks You Can’t Miss
http://engineering.linkedin.com/event/video-3-big-data-tech-talks-you-can%E2%80%99t-miss

Big Data Needs an End in Sight
http://www.dataroundtable.com/

The “Big Five” IT trends of the next half decade: Mobile, social, cloud, consumerization, and big data
http://www.zdnet.com/blog/hinchcliffe/the-big-five-it-trends-of-the-next-half-decade-mobile-social-cloud-consumerization-and-big-data/1811

Hadoop Virtual Panel
http://www.infoq.com/articles/HadoopVirtualPanel

BigData Spain 2012
http://www.bigdataspain.org/

Nathan Marz: “Cascalog: Making Data Processing Fun Again”
http://blip.tv/clojure/nathan-marz-cascalog-making-data-processing-fun-again-5970118

Data science in the natural sciences
http://strata.oreilly.com/2012/11/data-science-natural-sciences.html

Big Data To Drive $232 Billion In IT Spending Through 2016
http://techcrunch.com/2012/10/17/big-data-to-drive-232-billion-in-it-spending-through-2016/

Strata NYC 2012 and PyData
http://blogger.ghostweather.com/2012/11/strata-nyc-2012-and-pydata.html

Grok turns data into action
https://www.numenta.com/grok_info.html

Un nuevo canal para servir grandes cantidades de datos
http://www.datasalt.es/2012/10/una-nueva-caneria-para-servir-grandes-cantidades-de-datos/

Big Data @ Foursquare: Slides from our recent talk
http://engineering.foursquare.com/2011/03/24/big-data-foursquare-slides-from-our-recent-talk/

The Value of Values – Rich Hickey
http://jaxenter.com/the-value-of-values-rich-hickey-44872.html
Creator of Clojure and Datomic, Rich Hickey delivers this excellent JAXconf keynote about how the definition of values has changed in light of the increasing complexity of information technology and the advent of Big Data.

MapReduce and Its Discontents
http://www.infoq.com/presentations/MapReduce-Pregel-Storm
Dean Wampler discusses the strengths and weaknesses of MapReduce, and the newer variants for big data processing: Pregel and Storm.

7 new types of jobs created by Big Data
http://www.smartplanet.com/blog/bulletin/7-new-types-of-jobs-created-by-big-data/682

Meet the New Boss: Big Data
http://online.wsj.com/article/SB10000872396390443890304578006252019616768.html

The GPU “Sweet Spot” for Big Data
http://www.datanami.com/datanami/2012-09-11/the_gpu_sweet_spot_for_big_data.html

BigData Diagram
http://bigdata.globant.com/wp-content/uploads/2012/06/diagramSubBig.png

A beginners guide to streamed data from Twitter
http://mike.teczno.com/notes/streaming-data-from-twitter.html

My Links
http://delicious.com/ajlopez/bigdata

Keep tuned!

Angel “Java” Lopez
http://www.ajlopez.com
http://twitter.com/ajlopez

December 18, 2013

Big Data: Links, News And Resources (3)

Filed under: Big Data, Links — ajlopez @ 6:26 pm

Previous Post
Next Post

More links in my historical series:

Twitter vuelve a cambiar el acceso a su API y desafía su ecosistema
http://www.uberbin.net/archivos/web2-0/twitter-vuelve-a-cambiar-el-acceso-a-su-api-y-desafia-su-ecosistema.php

The Year Ahead In Big Data? Big, Cool, New Stuff Looms Large!
http://blogs.forrester.com/james_kobielus/11-12-19-the_year_ahead_in_big_data_big_cool_new_stuff_looms_large

Mike Stolz on NoSQL and Big Data Design Patterns
http://www.infoq.com/interviews/mike-stolz-nosql-and-big-data-design-patterns

Big Data Architectures at Facebook
http://www.infoq.com/presentations/Big-Data-Architectures-at-Facebook

chrisclark / PythonForDataScience
https://github.com/chrisclark/PythonForDataScience

Big data: extraer y visualizar grandes volúmenes de datos
http://www.meetup.com/HacksHackersBA/photos/9277872/131935032/

Adam & Greg Talk Storm, Big Data and Real Time Analytics with Dr. Matt
http://videomind.ooyala.com/blog/adam-greg-talk-storm-big-data-and-real-time-analytics-dr-matt

What is the Stratosphere System?
https://www.stratosphere.eu/

Big Data Architecture at LinkedIn
http://www.infoq.com/interviews/12-mar-sid-anand

Factual Releases Drivers that Matter: Python, Clojure, Haskell
http://blog.factual.com/factual-releases-drivers-that-matter-python-clojure-haskell

Just the Facts. Yes, All of Them.
http://www.nytimes.com/2012/03/25/business/factuals-gil-elbaz-wants-to-gather-the-data-universe.html

Big Data, Hadoop on Azure and the elephant in the room
http://blogs.southworks.net/ajezierski/2012/05/08/big-data-hadoop-on-azure-and-the-elephant-in-the-room/

R Is Not Enough For “Big Data”
http://www.forbes.com/sites/douglasmerrill/2012/05/01/r-is-not-enough-for-big-data/

Big Data Counting: How To Count A Billion Distinct Objects Using Only 1.5KB Of Memory
http://highscalability.com/blog/2012/4/5/big-data-counting-how-to-count-a-billion-distinct-objects-us.html

Big Data Week London – Hadoop Day
http://bigdataweek.com/big-data-week/big-data-uk/big-data-week-london/hadoop-day/

IBM doing Hadoop as a service in its cloud
http://gigaom.com/cloud/ibm-doing-hadoop-as-a-service-in-its-cloud/

Dremel: Interactive Analysis of Web-Scale Datasets
http://research.google.com/pubs/pub36632.html

Google BigQuery
https://developers.google.com/bigquery/
Use Google BigQuery to interactively analyze massive datasets — up to billions of rows.

How Twitter is doing its part to democratize big data
http://gigaom.com/cloud/how-twitter-is-doing-its-part-to-democratize-big-data/

HADOOP Enters the Enterprise Mainstream, and Big Data Will Never Be the Same
http://www.dbta.com/Articles/Editorial/Trends-and-Applications/HADOOP-Enters-the-Enterprise-Mainstream-and-Big-Data-Will-Never-Be-the-Same-81141.aspx

Big Data: Big Opportunities
http://www.emc.com/microsites/cio/articles/big-data-big-opportunities/index.htm

My Links
http://delicious.com/ajlopez/bigdata

Keep tuned!

Angel “Java” Lopez
http://www.ajlopez.com
http://twitter.com/ajlopez

December 14, 2013

Big Data: Links, News And Resources (2)

Filed under: Big Data, Links — ajlopez @ 5:44 pm

Previous Post
Next Post

What is big data?
http://radar.oreilly.com/2012/01/what-is-big-data.html
An introduction to the big data landscape.

Do we have the tools we need to navigate the New World of Data?
http://blogs.technet.com/b/dataplatforminsider/archive/2012/02/28/do-we-have-the-tools-we-need-to-navigate-the-new-world-of-data.aspx

Hadoop named as most popular big data source of 2011: report
http://www.zdnet.com/blog/btl/hadoop-named-as-most-popular-big-data-source-of-2011-report/70314

DataSift Architecture Overview
http://yfrog.com/z/nuuwzp

DataSift
http://datasift.com/
Social-data platform to enable enterprises and entrepreneurs to aggregate, filter and extract insights from Twitter in real-time

Big Data: Big Opportunities to Create Business Value
http://www.emc.com/microsites/cio/articles/big-data-big-opportunities/index.htm
http://www.emc.com/microsites/cio/index.htm

Oxford Internet Institute
Big Data Research Officer
http://www.oii.ox.ac.uk/people/newpositions/#p21

How to “crunch” your data stored in HDFS?
http://blog.octo.com/en/how-to-crunch-your-data-stored-in-hdfs/

STXXL: Standard Template Library for Extra Large Data Sets.
http://stxxl.sourceforge.net/
The core of STXXL is an implementation of the C++ standard template library STL for external memory (out-of-core) computations, i. e., STXXL implements containers and algorithms that can process huge volumes of data that only fit on disks.

Hadoop and NoSQL in a Big Data Environment
http://www.infoq.com/interviews/11-nov-ron-bodkin

Good Relationships
http://www.infoq.com/minibooks/good-relationships-spring-data
With Spring Data, the ever popular Spring Framework has cultivated a new patch of ground, bringing Big Data and NOSQL technology like Neo4j to enterprise developers.

Sorting 1PB with MapReduce
http://googleblog.blogspot.com/2008/11/sorting-1pb-with-mapreduce.html

Real-time feed processing with Storm
http://www.datasalt.com/2012/01/real-time-feed-processing-with-storm/
http://www.datasalt.es/2012/01/arquitectura-de-feeds-tiempo-real-con-storm/

DataSalt
http://www.datasalt.com/

Career of the Future: Data Scientist [INFOGRAPHIC]
http://mashable.com/2012/01/13/career-of-the-future-data-scientist-infographic/

The Business of  BIG DATA
http://www.slideshare.net/BenSiscovick/the-business-of-big-data-ia-ventures-v02

Brave New Big Data World
http://www.scoop.it/t/brave-new-big-data-world/

The King of Big Data
http://blogs.wsj.com/tech-europe/2011/11/07/the-king-of-big-data/?mod=google_news_blog
One of the next big things that enterprises need to understand is Big Data, but the demands Big Data makes require a different kind of way of looking at data.

Big crime meets big data
http://radar.oreilly.com/2011/12/marc-goodman-data-crime.html
Data and social media are being used against us in creative new ways.

Machine learning for dummies
http://blogs.technet.com/b/next/archive/2011/02/16/machine-learning-for-dummies-john-platt.aspx

Embracing Uncertainty
http://embracinguncertainty.info/
http://scpro.streamuk.com/uk/player/Default.aspx?wid=7739
The new machine intelligent

6 Big HealthTech Ideas That Will Change Medicine In 2012
http://techcrunch.com/2012/01/01/healthtech-2012/
Artificial Intelligence, Big Data …

Zenzey
http://beta.zenzey.com/site/

The feedback economy
http://radar.oreilly.com/2012/01/the-feedback-economy.html
Companies that employ data feedback loops are poised to dominate their industries.

Cloudera puts the Hadoop in Oracle’s Big Data Appliance
http://gigaom.com/cloud/cloudera-brings-the-hadoop-to-oracles-big-data-appliance/

My Links
http://delicious.com/ajlopez/bigdata

Keep tuned!

Angel “Java” Lopez
http://www.ajlopez.com
http://twitter.com/ajlopez

June 12, 2013

Big Data: Links, News And Resources (1)

Filed under: Big Data, Links — ajlopez @ 1:48 pm

Next Post

My first links about this topic:

http://en.wikipedia.org/wiki/Big_data

Big data[1][2] is a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications. The challenges include capture, curation, storage,[3] search, sharing, transfer, analysis,[4] and visualization. The trend to larger data sets is due to the additional information derivable from analysis of a single large set of related data, as compared to separate smaller sets with the same total amount of data, allowing correlations to be found to “spot business trends, determine quality of research, prevent diseases, link legal citations, combat crime, and determine real-time roadway traffic conditions.”[5][6][7]

As of 2012, limits on the size of data sets that are feasible to process in a reasonable amount of time were on the order of exabytes of data.[8][9] Scientists regularly encounter limitations due to large data sets in many areas, including meteorology, genomics,[10] connectomics, complex physics simulations,[11] and biological and environmental research.[12] The limitations also affect Internet search, finance andbusiness informatics. Data sets grow in size in part because they are increasingly being gathered by ubiquitous information-sensing mobile devices, aerial sensory technologies (remote sensing), software logs, cameras, microphones, radio-frequency identification readers, andwireless sensor networks.[13][14] The world’s technological per-capita capacity to store information has roughly doubled every 40 months since the 1980s;[15] as of 2012, every day 2.5 quintillion (2.5×1018) bytes of data were created.[16] The challenge for large enterprises is determining who should own big data initiatives that straddle the entire organization.[17]

Big data is difficult to work with using most relational database management systems and desktop statistics and visualization packages, requiring instead “massively parallel software running on tens, hundreds, or even thousands of servers”.[18] What is considered “big data” varies depending on the capabilities of the organization managing the set, and on the capabilities of the applications that are traditionally used to process and analyze the data set in its domain. “For some organizations, facing hundreds of gigabytes of data for the first time may trigger a need to reconsider data management options. For others, it may take tens or hundreds of terabytes before data size becomes a significant consideration.”[19]

What is big data?
http://www-01.ibm.com/software/data/bigdata/

El desafío del “big data”, más que sólo grandes volúmenes de datos
http://tecno.americaeconomia.com/noticias/el-desafio-del-big-data-mas-que-solo-grandes-volumenes-de-datos

http://www.mckinsey.com/Insights/MGI/Research/Technology_and_Innovation/Big_data_The_next_frontier_for_innovation
Big data: The next frontier for innovation, competition, and productivity

Big Data
http://www.emc.com/microsites/bigdata/index.htm

Data Science Summit
http://www.greenplum.com/datasciencesummit/

Big Data: Evolution or Revolution?
http://www.infoq.com/news/2011/11/bigdata

DataSift Using MySQL, HBase, Memcached to Deal With Twitter Firehose
http://nosql.mypopescu.com/post/13540746376/datasift-using-mysql-hbase-memcached-to-deal-with

DataSift Architecture: Realtime Datamining At 120,000 Tweets Per Second
http://highscalability.com/blog/2011/11/29/datasift-architecture-realtime-datamining-at-120000-tweets-p.html

Explaining Hadoop to Your CEO
http://www.forbes.com/sites/danwoods/2011/11/03/explaining-hadoop-to-your-ceo/

The World’s Technological Capacity to Store, Communicate, and Compute Information
http://www.sciencemag.org/content/early/2011/02/09/science.1200970

The Big Data Boom Is the Innovation Story of Our Time
http://www.theatlantic.com/business/archive/2011/11/the-big-data-boom-is-the-innovation-story-of-our-time/248215/

MongoDB Intro & Application for Big Data
http://www.slideshare.net/doryokujin/mongodb-intro-application-for-big-data

The Big Data Bottleneck In The Consumer Web
http://techcrunch.com/2011/11/21/the-big-data-bottleneck-in-the-consumer-web/

Microsoft drops Dryad; puts its big-data bets on Hadoop
http://www.zdnet.com/blog/microsoft/microsoft-drops-dryad-puts-its-big-data-bets-on-hadoop/11226

Distributed Cache as a NoSQL Data Store?
http://www.infoq.com/news/2011/11/distributed-cache-nosql-data-sto

Building Scalable Systems: an Asynchronous Approach
http://www.infoq.com/presentations/Building-Scalable-Systems-Asynchronous-Approach

Big Data Intelligence on Hadoop
http://karmasphere.com/

Ville Tuulos on Big Data and Map/Reduce in Erlang and Python with Disco
http://www.infoq.com/interviews/tuulos-erlang-mapreduce

The elephant in the room … Hadoop and BigData!
http://mikethetechie.com/post/6822576191/the-elephant-in-the-room-hadoop-and-bigdata

Is Microsoft’s Future in Data-as-a-Service?
http://www.readwriteweb.com/cloud/2011/05/is-microsofts-future-in-its-ap.php

Resolving the contradictions between web services, clouds, and open source
http://radar.oreilly.com/2010/12/what-are-the-chances-for-a-fre.html

Strata Gems: Where to find data
http://radar.oreilly.com/2010/12/where-to-find-data.html

Big crime meets big data
http://radar.oreilly.com/2011/12/marc-goodman-data-crime.html
Data and social media are being used against us in creative new ways.

Tech Talk: Nathan Marz — “Clojure at BackType”
http://sna-projects.com/blog/2010/11/clojure-at-backtype/

Do We Need a New Programming Language for Big Data?
http://www.theopenforce.com/2010/09/do-we-programming-language-big-data.html

My Links
http://delicious.com/ajlopez/bigdata

Keep tuned!

Angel “Java” Lopez
http://www.ajlopez.com
http://twitter.com/ajlopez

Blog at WordPress.com.