Pyspark mllib jobs

Filter

My recent searches
Filter by:
Budget
to
to
to
Type
Skills
Languages
    Job State
    930 pyspark mllib jobs found, pricing in USD

    I Have a Hadoop Project, I have installed VMware and also Ubuntu, I need an expert to develop code for the sample given in the pdf chaper 11 and run and execute it. Also needs to have additional new features in it.

    $356 (Avg Bid)
    Urgent
    $356 Avg Bid
    5 bids

    I require someone with pyspark knowledge to produce a movie recommendation script in python 3+. The script should be able to run locally on a mac. I have pyspark installed and functional so I will be able to test once you have build the script. Attached are specification for the project and an faq. YOU are only require to implement Workload 2 (that is a simple neighborhood based on collaborative filtering algorithm for personalized recommendation) Key requirement is that the script should be completed by Wednesday 25th 2016 by 9 pm (sydney australia) time. So this project will be rewarded very quickly to the right candidate. Please state your experience with pyspark and python. Data for the project is available download from the following location

    $931 (Avg Bid)
    $931 Avg Bid
    5 bids

    ...mixed categorical and numeric data, attempt to build a model, report on whether the model is more predictive than chance, and then apply the model to new data. I want this for as many platforms as possible, including: MLpack, mxnet, , torch, keras, theano/tensorflow, dlib, vowpal wabbit, caffe, xgboost, cntk, scikit-learn (with all its various modes), aerosolve. smile, h2o, weka, spark mllib, deeplearning4j, , , anything unmentioned in python or r. Assume no more than 256 columns, assume columns may be either floating point or text fields that may be hashed into a unique identifier. You don't have to document installation. Assume prereqs are there, I just want to know how to apply to data. You're welcome to document knobs but they're not required. This ca...

    $707 (Avg Bid)
    $707 Avg Bid
    5 bids

    I have many small sized audio files in the cluster which I need to create 10 sec spectrogram of it . I need someone who can do using Pyspark or Hadoop streaming(Python) .

    $21 (Avg Bid)
    $21 Avg Bid
    1 bids

    Seeking Big Data Hadoop Experts in Hortonworks HDP/ Apache Spark, Java + 4 or more years of hands-on experience + 5 Star Rating + 2000 hours of experience + excellent communication in english + available in US EST time on Skype ASAP. Prior work samples required. Serious Candidates who can sign NDA please apply.

    $632 (Avg Bid)
    $632 Avg Bid
    13 bids

    Descripción de empleo<br />This is a seed requisition. We are staffing several teams, and different set of skills are required. We are looking for software engineers ...prototyping initiatives <br /><br />Qualifications<br /><br />You should possess a Bachelor of Science degree in Computer Science and/or Computer Engineering or equivalent degree. <br />Additional qualifications include: <br />-2+ years of working experience developing software with Python, Java or Scala <br />-Experience with SQL and/or NoSQL databases <br />-Experience with Spark streaming and MLlib (or willingness to learn) <br />-Knowledge on Machine Learning basics (or willingness to learn) <br />-Practice...

    $1 (Avg Bid)
    $1 Avg Bid
    1 bids

    I have already did the coding in PySpark for text classification, my data base is like {label = 3, text = "I like this product"}, {label = 1, text = "I don't like this product"}, {label = 5, text= 'very good'}..., Basically 5 label (class) and text based on text we need to predict the label using SVM/RandomForestClassifier/DecisionTree classifier in PySpark only. if not possible then in Scala using Spark. I have attached the my code and if you are able to correct it then I will assign to you and then we will work together further. Thanks

    $27 (Avg Bid)
    $27 Avg Bid
    2 bids

    I need you to develop some software for me. A PySpark program implementing machine learning concepts.

    $28 (Avg Bid)
    $28 Avg Bid
    4 bids

    I have a Spark application (coded with Scala) running on a cluster of 5 nodes. Each node has 125GB memory and 32 cores. There is a function/method containing 8000 loops totally. Inside the loop it calls MLlib k-Means, so the k-Means runs about 8000 thousand times. The running time of the big loop is around 5.5 hours currently on the Spark clusters. I want to reduce the running time significantly. May need to tune JVM parameters, Spark configuration parameters, or restructure the Scala application code, etc.

    $348 (Avg Bid)
    $348 Avg Bid
    3 bids

    Our company is building a highly innovative Business Intelligence platform. We require someone with expertise in Machine Learning with previous experience of building and deploying machine learning algorithms based in python and mllib via Apache Spark.  The ideal candidate will help us with a number of tasks including the following;  Understand our business requirements and help design, build and deploy machine learning algorithms to production.  Maintain existing python based algorithms and also assist in migration to mllib.  Suggest new ways that Machine Learning can be used to provide more value.  You will be working with a world class engineering team and with a number of high profile clients on challenging and interesting projects. A very g...

    $820 (Avg Bid)
    $820 Avg Bid
    9 bids

    I need you to develop some software for me using a piece of code in pySpark to do analysis on data. should utilize parquet files.

    $137 (Avg Bid)
    $137 Avg Bid
    3 bids

    We currently use SQL Server to store our data in the cloud. However, we would like to advantage of some of the other tools available namely Spark. We have downloaded and installed Python, Spark, Java, and Hadoop (however, this does not imply we have done it correctly). We wa... and Hadoop (however, this does not imply we have done it correctly). We want to take advantage of the distributed nature of Spark, ideally taking advantage of Mesos for resource management. We want to be sure and connect the python instance to IPyton/Jupyter for our purposes. We are looking for someone who can use Teamveiwer so we can document the process as you setup a fully functioning pyspark environment. Success will be measured by us being able to achieve 1 or 2 queries using what has been de...

    $31 (Avg Bid)
    $31 Avg Bid
    3 bids

    I want to build a recommendation system for online portal..So want to discuss some suggestion about algorithm and implementation..I am not looking for API developers who uses Spark MLlib api's....But want to disucss something indepth of statisistcs and mathematical modeling problem..Ideal would be data scientist

    $41 / hr (Avg Bid)
    $41 / hr Avg Bid
    14 bids

    Coding Machine Learning techniques like K Means, KNN, etc in Spark-Scala without using MLlib, i.e., coding the algorithm step by step without using inbuilt packages (or minimal usage) in Spark-Scala

    $105 (Avg Bid)
    $105 Avg Bid
    2 bids

    Spark with Mllib needed And hadoop (hive) required

    $163 (Avg Bid)
    $163 Avg Bid
    2 bids

    Around 80 lines of code in Excel VBA to be translated to Pyspark or Scala

    $26 (Avg Bid)
    $26 Avg Bid
    3 bids

    Spark with Mllib needed And hadoop (hive) required

    $587 (Avg Bid)
    $587 Avg Bid
    9 bids

    I would like a Spark program that implements Extreme Learning machines. I think that Spark would be a good framework for implementing the algorithm due to the fact that the neurons do not require tuning. There will be pieces you can use from Mllib but there will some parts that will have to be written from scratch. Here are some resources to help get you started: - - -

    $370 (Avg Bid)
    $370 Avg Bid
    2 bids

    Requerimos contratar desarrolladores para proyectos varios en español: Profesión: Ingeniero en Sistemas o afines - Conocimientos de Lenguaje SQL. - Conocimiento de herramientas ETL. - Conocimiento de Synapse (Pipelines, DataFactory) - Manejo de Storage Accounts. - Conocimiento de procesos de ingenieria de datos(Databricks) - Conocimiento de Pyspark, Python Experiencia en construcción de warehouse, lakehouse

    $20 / hr (Avg Bid)
    $20 / hr Avg Bid
    18 bids

    saya ingin melakukan read data dan write data dari localhost dengan pyspark di jupyter notebook

    $20 (Avg Bid)
    $20 Avg Bid
    1 bids

    saya ingin melakukan read dan write data localhost dengan menggunakan spark / pyspark di jupyter notebook

    $8 / hr (Avg Bid)
    $8 / hr Avg Bid
    1 bids

    Python, SQL, Spark,Pyspark, BigData

    $11 / hr (Avg Bid)
    $11 / hr Avg Bid
    8 bids

    Necesitamos un Data Engineer con conocimientos de Python/PySpark y Databricks en entorno Azure. Se haría cargo del mantenimiento de una de nuestras aplicaciones durante al menos 2 meses, ampliable. Deseables conocimientos de Datafactory y Retool.

    $38 / hr (Avg Bid)
    $38 / hr Avg Bid
    15 bids

    Entrada: tupla (id,termo) em que "id" é o identificador do documento e "termo" é uma palavra do texto já pré-processada. (Pseudocod/Python/PySpark/Spark)

    $100 (Avg Bid)
    $100 Avg Bid
    2 bids

    Desenvolvimento de algoritmo. sobre MapReduce, utilizando Pyspark/Spark...

    $10 - $30
    $10 - $30
    0 bids

    Pyspark, Python, impala, bigdata

    $1056 (Avg Bid)
    $1056 Avg Bid
    15 bids

    (50 p) Kısım 1 (Apache Spark Uygulaması Geli ¸stirme) 1- Apache Zeppelin notebook kurulmalıdır (Jupyter Notebook’a benzemekle beraber farklılıkları vardır). 2- Daha onceki aktivitilerinizi (KA3 SQL uygulamaları ve bir makine ögrenmesi problemi) Apache Spark SQL ve MLlib kütüphanelerini kullanarak Zeppelin üzerinde gerçekleyiniz. A ¸sagıdaki linklerde çe ¸sitli örnekler mevcuttur. ˘ 3- Zeppelin notebook’unuzu formatında gonderiniz. Çe ¸sitli Zeppelin Notebookları için bakınız: Faydalı videolar:

    $10 - $40
    $10 - $40
    0 bids

    Analiza sentymentu Tweetow przy uzyciu SPARK (PySpark) oraz graficzna reprezentacja analizy. Wiecej szczegolow prywatnie.

    $237 (Avg Bid)
    $237 Avg Bid
    2 bids

    ...Azure Functions, git, VSTS, C#, SQL, NoSQL (documentDB) sont obligatoire ; Python, Azure Event Hub, SSIS, Docker, sont des plus. TJM : 600€ Mission : La DSI France a pour mission d’industrialiser puis opérer un POC data science autours des Restaurants d’Entreprise. L’industrialisation a démarré il y a 6 mois, le code des notebooks à quasiment totalement été porté des notebooks vers des scripts PySpark respectant les bonnes pratiques (unit test, isolation and dependency management). Néanmoins le système de run est à mettre en place, pour ce faire il faudra : • Consolider la Continuous Integration / Delivery • Mettre en place un dashboard de monitoring de la production • G...

    $555 (Avg Bid)
    $555 Avg Bid
    3 bids