Kezhi's Blog

锦绣太原青年博士行

发表于 2024-05-31 分类于 Journal Waline：本文字数： 3.8k 阅读时长 ≈ 3 分钟

24年5月28日 - 24年5月31日，我和果果一起参加了青年博士太原行。总的来说，这次太原之行给我最大的感受就是惊喜。

The magnificent Taiyuan young doctors journey.

发表于 2024-05-31 分类于 Journal Waline：本文字数： 11k 阅读时长 ≈ 10 分钟

This is an automatically translated post by LLM. The original post is in Chinese. If you find any translation errors, please leave a comment to help me improve the translation. Thanks!

24th May 28th - 31st, 24th, I participated in the Youth Doctor Taiyuan Tour with Guoguo. Overall, the trip to Taiyuan gave me the biggest surprise.

阅读全文 »

浅谈信任

发表于 2024-03-25 分类于 Journal Waline：本文字数： 730 阅读时长 ≈ 1 分钟

不得不说，信任是十分珍贵且稀少的一样东西。其珍贵和稀少的原因很简单——怀疑是很容易的。

阅读全文 »

Talking About Trust.

发表于 2024-03-25 更新于 2024-04-20 分类于 Journal Waline：本文字数： 2.3k 阅读时长 ≈ 2 分钟

This is an automatically translated post by LLM. The original post is in Chinese. If you find any translation errors, please leave a comment to help me improve the translation. Thanks!

It has to be said that trust is something extremely precious and rare. The reason for its preciousness and rarity is simple - doubt is easy.

阅读全文 »

炼丹工业化——mlflow使用简介

发表于 2024-03-21 分类于 Note Waline：本文字数： 4.7k 阅读时长 ≈ 4 分钟

丹药管理是炼丹术中很重要的部分。毕竟，当一批一批的丹药通过不同的材料和火候炼制出来之后，我们总是会产生如下需求：

对丹药效果进行评估，筛选出好的丹药
将一批丹药的效果进行对比，从而发现好的材料配比（超参数）
发现了一颗效果好的丹药，想要复现这颗丹药的生产过程
......

为了实现以上需求，我们便需要炼丹管理技术，简单来说，就是要在 代码，超参数，模型之间建立映射关系并对齐进行存储，以便支持后续的分析以及训练过程的回溯。由此，诞生了很多机器学习全生命周期管理软件，本文主要介绍一款开源软件：MLflow 的简单使用流程

阅读全文 »

Alchemy Industrialization - Introduction to mlflow.

发表于 2024-03-21 更新于 2024-04-20 分类于 Note Waline：本文字数： 7.5k 阅读时长 ≈ 7 分钟

This is an automatically translated post by LLM. The original post is in Chinese. If you find any translation errors, please leave a comment to help me improve the translation. Thanks!

Dan medicine management is a crucial part of alchemy. After all, when batches of pills are produced through different materials and firing processes, we always have the following needs:

Evaluate the effects of the pills and select the good ones
Compare the effects of batches of pills to discover good material ratios (hyperparameters)
Reproduce the production process of a pill with good effects
......

To meet these needs, we need alchemy management technology. In simple terms, we need to establish mappings and alignments between code, hyperparameters, and models for storage, in order to support subsequent analysis and training process tracing. As a result, many machine learning lifecycle management software have been developed. This article mainly introduces an open-source software called MLflow and its simple usage process.

阅读全文 »

第一篇论文完成后

发表于 2024-03-07 分类于 Research Waline：本文字数： 1.7k 阅读时长 ≈ 2 分钟

前段时间，我完成了第一篇论文，经历了从选题 -> 实验 -> 结果分析 -> 论文的全过程。在此期间虽然走了一些弯路，但也对科研发文的方法论有了新的认识。本文将围绕该次发文过程，概述科研大体流程，并对其中的一些细节进行思考。为今后的科研发文以及想要发文章的朋友们提供参考。

致谢：感谢在实验和论文撰写过程中给予我指导的老师以及与我探讨的朋友们。

Generated by ChatGPT-4

阅读全文 »

After the completion of the first paper.

发表于 2024-03-07 更新于 2024-04-20 分类于 Research Waline：本文字数： 5.3k 阅读时长 ≈ 5 分钟

This is an automatically translated post by LLM. The original post is in Chinese. If you find any translation errors, please leave a comment to help me improve the translation. Thanks!

Recently, I completed my first research paper, going through the entire process from topic selection to experiments, result analysis, and paper writing. Although I took some detours along the way, I gained new insights into the methodology of scientific research and writing. This article will outline the general process of scientific research and discuss some of the details, providing reference for future research and for those who want to publish articles.

Acknowledgments: Thanks to the teachers who guided me during the experiments and paper writing, as well as friends who discussed with me.

Generated by ChatGPT-4

阅读全文 »

Policy Optimization

发表于 2023-12-12 更新于 2023-12-18 分类于 Research Waline：本文字数： 13k 阅读时长 ≈ 12 分钟

本文主要对基于策略优化定理的强化学习算法及相关变体进行一个梳理和总结。

阅读全文 »

Policy Optimization

发表于 2023-12-12 更新于 2024-04-20 分类于 Research Waline：本文字数： 6k 阅读时长 ≈ 5 分钟

This is an automatically translated post by LLM. The original post is in Chinese. If you find any translation errors, please leave a comment to help me improve the translation. Thanks!

This article mainly reviews and summarizes reinforcement learning algorithms based on policy optimization theorems and related variants.

阅读全文 »