Final make this extension, which can automatically switch target language when source language is same with target language.
What surprised me was I only spen half day to program and build this extension, which inspired to do it immediately after have a new idea, this will be the best experience not only for result but also for my satisfaction.
This can save lots of time when using Goole Translate. In addition, I can totally transfer from baidufanyi to Google Translate.
Found the API used in Google Translate extension and use it in my extension, maybe it’s not a nice behavior.
It works well in for five days, hope it works for a long time.
I want to do more about Triton, here is all what I can think
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
* rewrite or improve current kernel
* flash attention
* cut cross entropy
* CE
* analysis time and memory cost of kernel
* compare different model and framework [H]
* Unsloth
* Deepseed
* TensorRT-LLM
* vLLM
* Liger-Kernel
* Flash-attention
* xformers
* DeepSeek
* MoE
* electricity
* MLA
* MTP
* fp8
* dimensions
* fine-tune
* inference
* improve and test
* architecture ability
* cuda
* llama.cpp
As a result, I will try Pytroch profiler at first, there will be more thing in it I can do.
In addition, Found a good post about matrix multiplication, try to do it by Triton.
1) Do it as soon as possible after having a new idea.
2) I always travel in a loop between hope and disappointment, maybe this is the life.
This post is licensed under CC BY 4.0 by the author.
Final make this extension, which can automatically switch target language when source language is same with target language.
What surprised me was I only spen half day to program and build this extension, which inspired to do it immediately after have a new idea, this will be the best experience not only for result but also for my satisfaction.
This can save lots of time when using Goole Translate. In addition, I can totally transfer from baidufanyi to Google Translate.
Found the API used in Google Translate extension and use it in my extension, maybe it’s not a nice behavior.
It works well in for five days, hope it works for a long time.
I want to do more about Triton, here is all what I can think
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
* rewrite or improve current kernel
* flash attention
* cut cross entropy
* CE
* analysis time and memory cost of kernel
* compare different model and framework [H]
* Unsloth
* Deepseed
* TensorRT-LLM
* vLLM
* Liger-Kernel
* Flash-attention
* xformers
* DeepSeek
* MoE
* electricity
* MLA
* MTP
* fp8
* dimensions
* fine-tune
* inference
* improve and test
* architecture ability
* cuda
* llama.cpp
As a result, I will try Pytroch profiler at first, there will be more thing in it I can do.
In addition, Found a good post about matrix multiplication, try to do it by Triton.
1) Do it as soon as possible after having a new idea.
2) I always travel in a loop between hope and disappointment, maybe this is the life.
This post is licensed under CC BY 4.0 by the author.