QuicksearchCodenews SearchDisclaimerThe individual owning this blog works at Sun Microsystems GmbH in Germany, a subsidiary of Oracle. The opinions expressed here are his own, are not necessarily reviewed in advance by anyone but the individual author, and neither Oracle nor any other party necessarily agrees with them.
NavigationCategories
|
Digging into Apache HadoopThursday, October 2. 2008Trackbacks
Trackback specific URI for this entry
No Trackbacks
Comments
Display comments as
(Linear | Threaded)
Have you seen the OpenSolaris/Hadoop live CD? http://opensolaris.org/os/project/livehadoop/
Yes ... i saw it ... but i want to integrate it in my alread running testbed
(Your spam prevention is preventing valid comments, BTW. It took me way too many tries to get around it.)
I'm not sure I understand what you mean about not being able to work with compressed files. The documentation at hadoop.apache.org/core/docs/current/native_libraries.html describes how to turn on native compression in Hadoop so that it can read/use gzip, lzo, etc, compression as part of the MR job.
Sorry for the hassles with the spam prevention but with a less stringent spam prevention i would use my free time with deleting spams ...
I read in a presentation that you canīt splits compressed files in shards. That sounded logical, as you canīt take out 10 MB out of a gz file and gunzip it. I have to admit that iīm in my early stages to dig into hadoop. I will further dig into the documentation ...
You might be interested in Hive, Facebook's "alternative" to HBase, as well - it seems to provide a better interface (SQL-like, rather than the HBase shell).
Also, the way we got around the compression issue (our files were tarred and gzipped) was to extract and resize the archives on the fly -- to meet the shard size I believe. I wasn't personally involved in that part, so you'll have to forgive me if I've got it wrong!
Check out CloudBase-
http://cloudbase.sourceforge.net It is a data warehouse system built on top of Hadoops Map Reduce architecture that allows one to query Terabyte and Petabyte of data using ANSI SQL. It comes with a JDBC driver so one can use third party BI tools, reporting frameworks to directly connect to CloudBase. CloudBase creates a database system directly on flat files and converts input ANSI SQL expressions into map-reduce programs for processing flat files. It has an optimized algorithm to handle Joins and plans to support table indexing in next release. |
Links in this articleThe LKSF bookThe book with the consolidated Less known Solaris Tutorials is available for download here
Twitterfeedstwitter.com/c0t0d0s0
just blogged: links for 2010-03-13: EsoWatch Âŧ Ein bisschen âhomÃķopathischeâ Brustkrebsfors... http://bit.ly/bCTDGu twitter.com/codenews 6933979 s10u9_04 nxge: undefined symbol 'hv_niu_cfgh_tx_logical_page_info' http://bit.ly/b7GQB5 twitter.com/SunPatches 134008-32 - Cumulative Maintenance HOLDDATA Sun StorageTek ELS MVS 7.0. Available since Mar/11/10. http://bit.ly/c1Y5x1 twitter.com/SolPatchesX86 119784-15 - SunOS 5.10_x86: bind patch. Available since Mar/12/10. http://bit.ly/928cLT twitter.com/SolPatchesSPARC 114014-26 - SunOS 5.9: libxml, libxslt and Freeware man pages Patch. Available since Mar/12/10. http://bit.ly/9nXcb4 Web 2.0Contact
Networking open.bc My photos SyndicationTagged articlesAMD Apple avs Bahn Blogging Blogosphere braindump Business Travel CeBIT cec cec2006 CMT del.icio.us deutsch dtrace fliegen Fundsache General Hamburg IBM i hate sundays Intel iscsi jumpstart Links Linux lksf Mindfuck Movies Music Musik Niagara Opensolaris Opteron Photographie policy of ... Politik Security Solaris storage Sun suncec2007 sunw t1 The IT Business Ultrasparc ultrasparc t1 Wirtschaft Work ZFS
Comments about Who are you?
Sun, 14.03.2010 02:03
Solaris, linux, HP-UX, aix sys
admin and programmer from Icel
and. Work for one of the telco
s here supporting a rang [...]
about Who are you?
Sat, 13.03.2010 20:55
Technical Director of a UK Res
eller, love the LKSF and your
comments on all things Sun/Ora
cle.
about Who are you?
Sat, 13.03.2010 20:33
Solaris admin for playing dail
y with M9000, T5240, s10 and c
luster. Been using sun gear fo
r 18 years.
Opensolaris [...]
about Who are you?
Sat, 13.03.2010 19:42
Sysadm, developer, trainer. My
first contact with SunOS was
back in the SunOS 2 era, a lon
g time ago. I think I fi [...]
Buttons![]() This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 Germany License
![]() ![]() ![]() Blog Administration |