-
Notifications
You must be signed in to change notification settings - Fork 9
Amangu/dev #40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: sanbao_trace
Are you sure you want to change the base?
Amangu/dev #40
Conversation
…mm_batched_simple benchmark.
…mm_batched_simple benchmark. [Part-2]
|
|
||
| def gemm_simple( | ||
| m: int, k: int, n: int, num_runs: int = 1, trace_dir: str = None | ||
| m: int, k: int, n: int, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I prefer not to change my implementation on gemm_simple now as it required changes on yaml and other part. Let us focus on delivering results now. Please remove these changed on gemm_simple. Thanks.
| ) -> Dict[str, Any]: | ||
| # Calculate FLOPs | ||
| total_flops = 2 * m * k * n # Total floating-point operations | ||
| total_flops = (2 * k + 1) * m * n # Total floating-point operations |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Same as gemm_simple.
…rics to get correct ragged_dot wall_time. Added a try catch to continue the sweap even if some configs failed due to lack of resources.
…gh it won't be used in the calculations.
Adding couple of benchmarks:
Modified some benchmarks to support multiple input and output dataTypes