0
0

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?

More than 5 years have passed since last update.

Sparkのメトリクスを取る(WIP)

Last updated at Posted at 2015-05-31

目的

  • Sparkのメトリクスを取りたい
  • 各ノードの性能監視をしたい

データソース

  • Sparkのmetrics
  • Gangliaのmetrics
  • 各ノードの性能情報
    • cpu, load average, memory usage, disk i/o, network i/o...

Sparkのmetrics

公式ドキュメント(1.3.1)

  • http://<driver-node>:4040
  • To view the web UI after the fact, set spark.eventLog.enabled to true before starting the application. This configures Spark to log Spark events that encode the information displayed in the UI to persisted storage.
  • If Spark is run on Mesos or YARN, it is still possible to reconstruct the UI of a finished application through Spark’s history server, provided that the application’s event logs exist.
    • ./sbin/start-history-server.sh

検討事項

  • 実際に取れるメトリクスの値は不明
  • GraphiteSinkがあるので、Graphiteに直接メトリクスを保存できるみたい
  • Ganguliaのメトリクスも取れるっぽいけど、ライセンスの問題でSparkをカスタムビルドしないとダメらしい
    • Spark on EMRではGanguliaが使えるようにプロビジョニングできたはずなので、このカスタムビルド仕様になってないか確認する

Gangliaのmetrics

EMRのbootstrap-actionのinstall-ganglia-metricsを見ると、GangliaSink.scalaを使うようになっている。以下、追いかけたソースコード。

0
0
0

Register as a new user and use Qiita more conveniently

  1. You get articles that match your needs
  2. You can efficiently read back useful information
  3. You can use dark theme
What you can do with signing up
0
0

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?