How do I contribute to open source projects like Hadoop and HBase without access to large clusters?
-
I've been learning about open source projects like Hadoop & HBase, and they seem like a lot of fun to work on. I'd like to learn about these technologies more in-depth (and in general learn more about distributed systems) but do not work in a company that uses these projects and don't have access to large clusters of machines. What would be the best way to get involved? Thanks!
-
Answer:
For learning Hadoop and HBase, you can setup a pseudo distributed cluster on your local machine and use that. You can read up Tom White's Hadoop: The Definitive Guide[1] and Lars George's HBase: The Definitive Guide[2] and try the sample programs. Jason Venners Pro Hadoop[3] is an excellent resource too. The Hadoop[4] and HBase wiki's[5] have excellent documentation too, as does Cloudera's training videos[6] and blog.[7] The blog also has many use cases and practical applications of Hadoop and related technologies. If you want to try your programs on a bigger cluster, you can try the cloud providers like AWS or GoGrid and set up a cluster there. You could also use Amazon's Elastic Map Reduce service. When you are ready to contribute, you can start answering questions on the mailing lists, submit patches, provide documentation and even code features. In an open source project, its always as much as you want to do! If you want, you can also contribute to open source applications and tools built for Hadoop and HBase. If you would like to contribute to Crux Reporting for HBase [8] or HIHO framework for Hadoop [9], let me know. [1]: http://www.amazon.com/Hadoop-Definitive-Guide-Tom-White/dp/0596521979 [2]: http://shop.oreilly.com/product/0636920014348.do [3]:http://www.amazon.com/Hadoop-Experts-Voice-Open-Source/dp/1430219424/ref=sr_1_1?s=books&ie=UTF8&qid=1315592454&sr=1-1 [4]: http://hadoop.apache.org/ [5]: http://hbase.apache.org/ [6]: https://ccp.cloudera.com/display/SUPPORT/Learn+Hadoop [7]: http://www.cloudera.com/blog/ [8]: https://github.com/sonalgoyal/crux [9]: https://github.com/sonalgoyal/hiho
Sonal Goyal at Quora Visit the source
Other answers
You can also have a look at Fedora BigData SIG [1] that has come up relatively recently. There's also a GitHub repo. maintained for the same [2]. [1] - https://fedoraproject.org/wiki/SIGs/bigdata?rd=Big_data_SIG [2] - https://github.com/fedora-bigdata
Aditya Bhardwaj
Related Q & A:
- Are there any open source projects for building websites like elance?Best solution by Stack Overflow
- How do i get an email alert sound like aol "you've got mail?Best solution by Yahoo! Answers
- How can I change what my avatar looks like?Best solution by Yahoo! Answers
- How come i can't open my yahoo messenger?Best solution by Yahoo! Answers
- How do i edit a picture to be like this?Best solution by Yahoo! Answers
Just Added Q & A:
- How many active mobile subscribers are there in China?Best solution by Quora
- How to find the right vacation?Best solution by bookit.com
- How To Make Your Own Primer?Best solution by thekrazycouponlady.com
- How do you get the domain & range?Best solution by ChaCha
- How do you open pop up blockers?Best solution by Yahoo! Answers
For every problem there is a solution! Proved by Solucija.
-
Got an issue and looking for advice?
-
Ask Solucija to search every corner of the Web for help.
-
Get workable solutions and helpful tips in a moment.
Just ask Solucija about an issue you face and immediately get a list of ready solutions, answers and tips from other Internet users. We always provide the most suitable and complete answer to your question at the top, along with a few good alternatives below.