MySQL DBAs know that integrating MySQL and a big data solution can be challenging. That’s why I invite you to join me this Wednesday (Oct. 2) at 10 a.m. Pacific time for a free webinar in which I’ll walk you through how to implement a successful big data strategy with Apache Hadoop and MySQL. This webinar is specifically tailored for MySQL DBAs and developers (or any person with a previous MySQL experience) who wants to know about how to use Apache Hadoop together with MySQL for Big Data.
The webinar is titled, “Implementing MySQL and Hadoop for Big Data,” and you can register here.
Storing Big Data in MySQL alone can be challenging:
Apache Hadoop together with MySQL can solve many big data challenges. In the webinar I will present:
I look forward the webinar and hope to see you there! Additionally, if you have questions in advance, please also ask those below, too.
Hi Alexandar,
i really liked your talk on sqoop, is there a tutorial that you like for mysql to sqoop integration?
I want to really try out analyzing data which is in a mysql over to hadoop
This is a link to the sqoop documentation: http://sqoop.apache.org/docs/1.99.2/
HI,
I tried loading data from a mysql db to Hadoop cluster (EMR) using sqoop and it worked fine, but when i try to use HIVE to load that data into a table in S3 i get this error
hive> LOAD DATA INPATH ‘/user/hadoop/wordcount/part-m-0000*’ INTO TABLE wordcount2;
FAILED: SemanticException [Error 10028]: Line 1:17 Path is not legal ”/user/hadoop/wordcount/part-m-0000*”: Move from: hdfs://10.xx.xx.xx:9000/user/hadoop/wordcount/part-m-0000* to: s3://iform-dev-s3-bucket-1/samples is not valid. Please check that values for params “default.fs.name” and “hive.metastore.warehouse.dir” do not conflict.
i think during the presentation you mentioned that it was possible to store data directly into S3 using sqoop with EMR?
am i doing something incorrect?
Hi Mandm,
Please try /user/hadoop/wordcount/ instead of /user/hadoop/wordcount/part-m-0000*. Hive takes a “directory” name. You can also use hadoop fs -cp to copy from the hdfs into S3 (it will use map reduce) and then create external table on top of hive.
Hi,
Nice article related to big data hadoop in this blog post.In this advanced technology, new tools are very important for us to handle big data problem and for this Apache Hadoop concept is important to learn.I have’nt seen your webinar but after went through your blog ,i wanted that webinar so that i could also learn somethingfrom your Online Hadoop Training courses.Hadoop is quite complex and challenging but it can lead to new career in Big data.
Thanks for the post
Time and Effort would be well appreciated.