Thursday, March 22, 2012

Stack Exchange Data Explorer and Big Data

There are a lot of properties under the StackExchange banner.  Here is a tool that lets you run SQL-style queries against them all.

Stack Exchange Data Explorer

Friday, March 16, 2012

Installing hadoop development cluster on Windows and Eclipse -- Introduction

Hadoop & HDFS on Windows here for all the non-Linux types.  Of course, this is missing all the Hive/Pig/etc goodness and the Cloudera configuration.

This tutorial describes how to set up such a cluster on a computer running Microsoft Windows. It also describes how to integrate this cluster with Eclipse, a prime Java development environment.

Installing hadoop development cluster on Windows and Eclipse -- Introduction

Wednesday, March 14, 2012

Java interop with .NET

To wire up those cloud apps that are written using Java, use some translation tools.

IKVM.NET is useful for several different software development scenarios. Here is a sampling of some of the possibilities.

Uses for IKVM.NET

Monday, March 12, 2012

How-to install R on CentOS « /home/kOoLiNuS

Once you get your Hadoop cluster setup, you’ll probably want R.  Here’s how to install it.

A not so christmas-connected topid today as I’m going to write down a few lines on how-to install the open source statistical tool and relative developement libraries R on CentOS release 4 and 5.

How-to install R on CentOS « /home/kOoLiNuS

CentOS 5 - Networking Support - Setting up a folder share using Likewise over an Active Directory Network

You may want to access your CentOS VM using a file share at one point.  Here’s how.

I'm making a shift from Ubuntu to CentOS 5. Things are obviously quite a bit different between the two, as I've discovered. I'd appreciate some input as I make the shift.
The first problem (and hopefully easiest) is designating a folder to be shared. In Ubuntu it was as easy as right-clicking a folder and click Share Folder. I need to learn what the method is for CentOS, either via the GUI or otherwise.

www.centos.org - Forums - CentOS 5 - Networking Support - Setting up a folder share using Likewise over an Active Directory Network

Cloudera–VM Setup to spin up some Cloud Computing Power

Cloudera provides a demo VM for download, to try out Hadoop, Hive, and those other friendly Zoo animals.

https://ccp.cloudera.com/display/SUPPORT/Cloudera's+Hadoop+Demo+VM#Cloudera%27sHadoopDemoVM-RunningtheVM

If you’re trying to get the VM setup on Windows in Oracle VirtualBox

With the VM powered off, click on ... the VM Name->Details-tab>General-> Advanced tab .. and select the checkbox "Enable IO APIC".
Power on the VM.. and that should at least get you and running.

www.centos.org - Forums - CentOS 5 - Miscellaneous Questions - Installing CentOS 5 in VirtualBox

Setup Virtualbox to run in Bridged Network mode

Fix permissions to run Hadoop jobs:

cd /var/lib/hadoop-0.20/cache/cloudera/
chmod 777 .

Hop right into Beehive with Cloudera/Cloudera userid/pw and try out the samples.

http://localhost:50070 

Try out Hue

http://localhost:8088

Setting up the stuff, end-to-end

http://gbif.blogspot.com/2011/01/setting-up-hadoop-cluster-part-1-manual.html

Changing the terminal window colors in VI (because blue directories hurt my eyes)

http://www.cs.colostate.edu/helpdocs/vi.html
http://ricochen.wordpress.com/2011/05/29/change-centos-ssh-shell-color/

See the IP address of the machine
http://www.cyberciti.biz/faq/how-to-find-out-the-ip-address-assigned-to-eth0-and-display-ip-only/

Monday, March 5, 2012

RenderMan in the cloud

I remember working on some rendered video back in the 90s. A single 30 second scene could take days to render at 25fps. This should bring the time down quite a lot.

Pixar's RenderMan® / Press Releases: "New cloud rendering service available in collaboration with GreenButton®

EMERYVILLE, CA – (January 19th, 2012) Pixar Animation Studios today launched a new cloud rendering service called "RenderMan® On Demand™." Developed in collaboration with pioneering cloud services provider GreenButton, this service is now available on Microsoft's Windows® Azure with Linux availability following later in 2012. "RenderMan On Demand" offers nearly instantaneous burst rendering access to hundreds and even thousands of additional cores, enabling artists and studios to deliver the highest-quality cinematic imagery with unprecedented economy and efficiency."


'via Blog this'