Quick tips for using Big Data for your growing business
Contratar Big Data Salespeople
You are given 2 CSV data sets: (a) A course dataset containing details of courses offered (b) A job description dataset containing a list of job descriptions (Note: Each field of a job description record is demarcated by " ") You have to design and implement a distributed recommendation system using the data sets, which will recommend the best courses for up-skilling based on a given job description. You can use the data set to train the system and pick some job descriptions not in the training set to test. It is left up to you how you pick necessary features and build the training that creates matching courses for job profiles.
We are looking for someone to supply us with a list of UK-based Hospital Doctors and Nurses. This data must have the candidate's name, GMC number, speciality, and hospital they are at, also sub specialities if possible, email address, phone number, also personal address if avalible.
We need you to use CRM software to identify and fix our user's stickiness, churn rate, DAU, data analytics, etc. Which CRM software are you using ? How are you going to fix our DAU ? How much do you want for this job ? Our app is travpart in google. Please let me knows what/which data you need to improve our DAU ? Our goal is to increase DAU and conversion rate for sale, How do I know you can do this ?
The write-up should include the main problem that can be subdivided into 3 or 4 subproblems. If I'm satisfied, we discuss further on implementation.
Problem Statement: Design scalable pipeline using spark to read customer review from s3 bucket and store it into HDFS. Schedule your pipeline to run iteratively after each hour. Create a folder in the s3 bucket where customer reviews in json format can be uploaded. The Scheduled big data pipeline will be triggered manually or automatically to read data from The S3 bucket and dump it into HDFS. Use Spark Machine learning to perform sentiment analysis using customer review stores in HDFS. Data: You can use any customer review data from online sources such as UCI
build a real-time sales dashboard for a Shopify website. half of the project is done just need to load data in was athena from s3 and then connect it to quick sight.
2+ years experience of Python coding skills with Flask framework or Fast API ● 2+ years experience in Python scripting. ● Solid database skills in open source databases (i.e. MongoDB, MySQL, etc.) ● Solid knowledge in writing Restful Secure APIs ● Good understanding of python frameworks & libraries - Familiarity with some ORM (Object Relational Mapper) libraries ● Good understanding of micro service architecture, and how to configure and deploy complex containerized applications ● Knowledge of user authentication and authorization between multiple systems, servers, and environments (JWT Authentication) ● MUST have experience to design, code, test, and analyze Python applications leveraging RDBMS (MySQL & MS SQL SERVER databases). ● Build and deployment process knowledge
I need to do a text mining on phyton. im doing a literature review - analyzing what academicians wrote on a certain topic over the years. my data set is around 1500 entries. I need text mining + topic modeling (lda). I will be guiding this guy on the steps i have 1142 data to be precise the research questions are these: How did research on gamification in tourism evolved? How is the academic research on gamification in tourism is distributed among geographical domains? What are the most commonly used keywords? Whom are the top authors researching this topic? Who are the co-authors that commonly research gamification in tourism? Which institutions publish the highest number of papers? What are the sources of the most cited publications? What are the individual fields in the references of...
What I am looking for is to identify companies that have taken (2) loans, one in both 2020 and 2021. I am further looking to sort the data by at least NAICS code (industry) (column AM) and State (column M) I want to be able to visualize and sort the data by who was most impacted, and that will be determined by the amount of jobs lost / gained 2021 vs 2020. We can discuss further visualization (I need your guidance) but this is a minimum. We know if they have taken (2) loans if they have the same company name in 2021 however if the company only shows up once, it is 2020 so they've taken (1) loan and are not of interest to us. The total data set is under a million records.