dataflow-cloud-profiler

How to integrate Google cloud profile with Dataflow jobs

By passing the --dataflowServiceOptions options

1.1 To enable CPU profiling, start the pipeline with the following option.

--dataflowServiceOptions=enable_google_cloud_profiler

1.2 To enable heap profiling, start the pipeline with the following options. Heap profiling requires Java 11 or higher.

--dataflowServiceOptions=enable_google_cloud_profiler
--dataflowServiceOptions=enable_google_cloud_heap_sampling

Example with mvn build.

mvn compile -e exec:java \
 -Dexec.mainClass=$MAIN \
      -Dexec.args="--project=$PROJECT \
      --stagingLocation=gs://$BUCKET/staging/ $* \
      --tempLocation=gs://$BUCKET/staging/ \
      --runner=DataflowRunner \
      --numWorkers=2\
      --dataflowServiceOptions=enable_google_cloud_profiler\
      --dataflowServiceOptions=enable_google_cloud_heap_sampling"

Programmatically, By creating and agent object with the below configurations.

 DataflowProfilingOptions profilingOptions = options.as(DataflowProfilingOptions.class);    
 DataflowProfilingAgentConfiguration agent = new DataflowProfilingOptions.DataflowProfilingAgentConfiguration();
 agent.put("APICurated", true);
 profilingOptions.setProfilingAgentConfiguration(agent);

Types of profiling available

Cloud Profiler supports different types of profiling based on the language in which a program is written. The following table summarizes the supported profile types by language:

Profile type	Go	Java	Node.js	Python
CPU time	Y	Y		Y
Heap	Y	Y	Y
Allocated heap	Y
Contention	Y
Threads	Y
Wall time		Y	Y	Y

For complete information on the language requirements and any restrictions, see the language's how-to page. For more information about these profile types, see Profiling concepts.

To Run the Demo

Create a pub/sub topic sensor_events
Run the python script to generate the sample data python send_sensor_xml_data.py --project PROJECCT_ID --speedFactor 60
run demo example ./run_on_cloud.sh

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
src		src
.gitignore		.gitignore
README.md		README.md
pom.xml		pom.xml
run_oncloud.sh		run_oncloud.sh
run_oncloud_from_local.sh		run_oncloud_from_local.sh
run_taxi_revenues.sh		run_taxi_revenues.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

dataflow-cloud-profiler

How to integrate Google cloud profile with Dataflow jobs

Types of profiling available

To Run the Demo

About

Releases

Packages

Languages

mokhahmed/dataflow-cloud-profiler

Folders and files

Latest commit

History

Repository files navigation

dataflow-cloud-profiler

How to integrate Google cloud profile with Dataflow jobs

Types of profiling available

To Run the Demo

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages