Skip to content

Conversation

@JesseStutler
Copy link
Member

  • Please check if the PR fulfills these requirements
  • The commit message follows our guidelines
  • What kind of change does this PR introduce? (Bug fix, feature, docs update, ...)
    /kind documentation

  • What this PR does / why we need it:
    The home page and introduction doc is too old, need to update

  • Which issue(s) this PR fixes:
    Related with [Docs] Complete official website documentation #330

@volcano-sh-bot volcano-sh-bot added retest-not-required-docs-only kind/documentation Categorizes issue or PR as related to documentation. labels Jan 9, 2025
@volcano-sh-bot volcano-sh-bot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Jan 9, 2025
@JesseStutler
Copy link
Member Author

cc @Monokaix

* 异构设备混合调度

### 网络拓扑感知调度
* 支持网络拓扑感知调度,优化应用间通信效率,提升分布式应用性能
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

语言要重新组织:突出对AI场景的优化,提升AI任务训练效率

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

### 多集群调度
* 支持作业跨集群调度,将VolcanoJob的能力扩展到多集群,实现更大规模的资源池管理

* x86
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这些可以保留

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

已恢复

[[item]]
title = "高性能调度"
content = "将特定领域作业转化为Kubernetes负载,并以绝佳的性能进行调度"
title = "统一调度"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

高性能统一调度?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

updated

[[item]]
title = "多种调度策略"
content = "Co-scheduling, Fair-Share, Gang scheduling, Topologies, Reserve/BackFill, Data-aware Scheduling等"
content = "支持 Gang、Binpack、DeviceShare、Capacity、Proportion 等多种调度策略,优化资源利用效率"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

numa topology也加上,Fair-Share保留

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

title = "多运行时支持"
content = "Singularity和GPU加速器"
title = "在线和离线业务混合部署"
content = "支持在线和离线业务混合部署,通过智能调度策略提升集群资源利用率"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

title: 在离线混部
content: 支持在线和离线业务混合部署,通过统一调度,动态资源超卖,CPU Burst,资源隔离等能力,提升资源利用率的同时保障在线业务QoS.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

overlay_filter = 0.5

[[item]]
title = "丰富的监控手段"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这里开业叫可观测性把dashboard也加进来

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

* [Cromwell](https://cromwell.readthedocs.io/)

另外,Volcano已经被作为基础设施调度引擎被多个公司和组织采纳商用。
此外,Volcano还被多个企业和组织作为其核心调度引擎商用,显著提升了大规模集群的管理效率。
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

显著提升了大规模集群的管理效率?突出AI/大数据场景的的资源管理,Job管理,调度性能和策略等。

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

• 异构设备支持:高效调度GPU、NPU等异构设备,充分释放硬件算力潜力。

***
• 网络拓扑感知:优化分布式应用间通信效率,提升整体性能。
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

提出AI场景的训练效率提升

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

@JesseStutler
Copy link
Member Author

cc @Monokaix @william-wang


> For more details about multi-cluster scheduling, see: [volcano-global](https://github.com/volcano-sh/volcano-global)
### Rescheduling
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

-> Descheduling

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

cta_icon = "graduation-cap"

[[item]]
title = "Rescheduling"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same above

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done


• Multi-cluster Scheduling: Supports cross-cluster job scheduling, improving resource pool management capabilities and achieving large-scale load balancing.

• Online-Offline Workloads Colocation: Enables online and offline workloads colocation, improving cluster resource utilization through intelligent scheduling strategies.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Online-Offline -> Online and Offline

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

的经验,并结合来自开源社区的最佳思想和实践。
• 多集群调度:支持跨集群作业调度,提升资源池管理能力,实现大规模负载均衡。

• 在离线混部:实现在线与离线任务混合部署,提升集群资源利用率。
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

加一个负载感知重调度

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

name = "KubeGene"
url = "https://github.com/kubegene/kubegene "
description = "KubeGene致力于简化,便携式和可扩展的基因组测序过程."
img_src = "ray_logo.png"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ray位置可以放前面

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

已和paddle paddle调换

@JesseStutler
Copy link
Member Author

Please cc @Monokaix @william-wang @kevin-wangzefeng

@Monokaix
Copy link
Member

/lgtm

@volcano-sh-bot volcano-sh-bot added the lgtm Indicates that a PR is ready to be merged. label Jan 21, 2025
Copy link
Member

@kevin-wangzefeng kevin-wangzefeng left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we change the layout of about volcano part?
Also, the logo used is out of date.

Copy link
Member

@kevin-wangzefeng kevin-wangzefeng left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/approve
The comments are not blocking, thanks

@volcano-sh-bot
Copy link
Collaborator

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: kevin-wangzefeng

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@volcano-sh-bot volcano-sh-bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jan 21, 2025
@volcano-sh-bot volcano-sh-bot merged commit 21fc4df into volcano-sh:master Jan 21, 2025
7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. kind/documentation Categorizes issue or PR as related to documentation. lgtm Indicates that a PR is ready to be merged. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants