I need to trigger Spark Jobs to aggregate data from a JSON file using an API call. I use spring-boot to create the resources. Thus, the steps for the solution is the following:
- User makes an POST request with a json file as the input
- The JSON file is stored in google bucket associated with dataproc cluster.
- A aggregating spark job is triggered from within the REST method with the specified jars, classes and the argument is the json file link.
I want the job to be triggered using Dataproc's Java Client instead of console or command line. How do you do it?
See Question&Answers more detail:
os 与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…