Find Jobs
Hire Freelancers

Looking for a Hadoop Spark / Redshift Developer

min $50 USD / hour

Zavřený
Zveřejněno před více než 6 roky

min $50 USD / hour

1)Please provide one (small / medium ) use case of your Hadoop ETL work in detail. 2)f there are some X number of customers and they made some purchases. Can you write an SQL to find out the TOP 5 customers who made the most purchases. 3)What did you use Map Reduce for? What did you use PIG for ? What did you use Hive for ? where does the data gets transformed? / While performing transformations where is the data? -------------------------------------------------- 4)Please provide one (small / medium ) use case of your work with Spark in detail -------------------------------------------------- 5)Please provide one (small / medium ) use case of your work with Redshift in detail. 6)Did you perform the ETL work? where does the data gets transformed? / While performing transformations where is the data? Or Did you simply load the data from Source to Redshift? 7)What are your Data Sources? How did you get the Data from Source to the AWS environment? 8)Did you use Python? If yes, What was the purpose you use Python for? Explain in Detail.
IČ projektu: 15239597

O projektu

17 nabídky
Vzdálený projekt
Aktivní před 6 roky

Chcete si vydělat nějaké peníze?

Výhody podávání nabídek na Freelancer

Stanovte si rozpočet a časový rámec
Získejte za svou práci zaplaceno
Načrtněte svůj návrh
Registrace a podávání nabídek je zdarma
17 freelanceři nabízejí v průměru $58 USD/hodinu za tuto práci
Avatar uživatele
Hi, My name is Benjamin. I'm an expert with over 14 years of experience. I have worked primarily with Spark for ETL. The data sources were Amazon S3 and REST APIs, formats being Parquet, JSON or CSV. 1 -> The case was to take transaction data and glean BI views 2 -> Yes, this should be easy once we have the data, with group by and sums 3 -> Data transformation happens on the cluster 4 -> Did a process automation for Machine Learning, apart from a few ETLs 5 -> I have experience loading data into RedShift for visualization. Would love to work with you on this. Look forward to hearing from you. Regards,
$55 USD v 40 dnech
5,0 (6 recenze)
6,0
6,0
Avatar uživatele
I have extensive experience working on Hadoop ecosystem: Hive, Spark, Sqoop, HBase, Redshift, Oozie, Storm, Impala, Kylin etc. Also, MongoDB, Cassandra, Spark SQL, Spark ML lib, Spark Streaming Q1. Please provide one (small / medium ) use case of your ETL work in detail. 1. Web Scrappers -> Kafka -> Elasticsearch -> Kibana [ 10^7 logs in 1PB data per day] 2. Twitter -> Python Producers -> Pyspark EMR -> Elasticsearch + Redshift -> Tableau [1GB per day] 3. Radio API -> Kinesis -> Hive Map Reduce -> MySQL [2 GB per day] 4. Web API + Mobile API -> Kafka -> Python Consumers -> Teradata -> Power BI [4 GB per day] less Q2. If there are some X number of customers and they made some purchases. Can you write an SQL to find out the TOP 5 customers who made the most purchases. SELECT top 5 custid, COUNT(distinct orderid) AS 'Purchases' FROM orders GROUP BY custid ORDER BY 2 DESC Q3. What did you use Map Reduce for? What did you use PIG for ? What did you use Hive for ? where does the data gets transformed? / While performing transformations where is the data? Map Reduce is a framework, it was used in the use case #3 above. I have not worked on PIG but understand how it is different from Hive. I have used Hive in use case #3 above. During Map Reduce the data is mapped (read) from a storage (usually HDFS or S3 bucket) and Reduced (transformed, aggregated etc.) using Hive Storage. On the other hand, if we do the same in Spark, it happens in memory.
$55 USD v 40 dnech
5,0 (6 recenze)
5,6
5,6
Avatar uživatele
Hi, Its quite unfortunate that we cannot answer the questions you have raised in an attachment and post as freelancer.com never allows a bidder to attach a file until the client responses once at least. We have highly skilled developer on Hadoop and Spark. If you want I can patch you to talk so that you may judge him the technical skills. I will discuss with you the required commercials. Agree? Please response and make a schedule. Let’s discuss, Reasons to choose us: ****************** 1. We are a Govt, registered company named Eclipse Technoconsulting Global Pvt. Ltd 2. We are the proud company to get Global quality appraisal as CMMi level 3. 3. We are ISO 9001:2008 Certified company. Regards, Mit,
$98 USD v 40 dnech
5,0 (1 recenze)
5,0
5,0
Avatar uživatele
Hi, I have experience in spark/hadoop and machine learning. for more information ping me. I will provide all details.
$55 USD v 40 dnech
5,0 (1 recenze)
3,1
3,1
Avatar uživatele
Hi, I’m a Web Designer/Developer from the UK. My name is Mike. Your project description sounds interesting to me and I do have skills & experience that are required to complete this project. Let's have a quick chat when you're online.
$55 USD v 40 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
We are a Team of Data Scientists having healthy experience into Big Data technologies like Hadoop,MapReduce and Data Analytics like R,HBase etc. The Team has qualified engineers having expertise in solving complex problems.
$56 USD v 83 dnech
0,0 (1 recenze)
0,0
0,0
Avatar uživatele
I have strong technical and data analytical skills. Have 2 yrs of experience in Data analytics and data engineering. Hard working.
$50 USD v 10 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
I am Big Data expert with over 14 years of experience. Among that over 5 years in Big Data Technologies. I have worked all the mentioned technologies, except RedShift.
$77 USD v 20 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
Hi plz discuss with me I am here to work on hourly basis plz I am waiting ur reply I will discuss with u point by point I am not known that aboutique first point but understood restate of plz message to discuss with u
$55 USD v 40 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
Dear Client, I am an expert data analytics having 5 year of experience. I have been worked with big companies like Careem, OLACAB, TCS as well before as in my past experience. I have excellent command on R language, Hadoop, Bigdata tools etc. Thanks Prateek
$55 USD v 40 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
A proposal has not yet been provided
$55 USD v 30 dnech
0,0 (0 recenze)
0,0
0,0

O klientovi

Pochází z UNITED STATES
United States
0,0
0
Členem od zář 23, 2017

Ověření klienta

Díky! Poslali jsme vám e-mailem odkaz pro získání kreditu zdarma.
Při odesílání e-mailu se něco pokazilo. Zkuste to prosím znovu.
Registrovaných uživatelů Zveřejněných projektů
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Načítání náhledu
Bylo uděleno povolení ke geolokaci.
Vaše doba přihlášení vypršela a byli jste odhlášeni. Přihlaste se znovu.