Today I spent time studying the DeepSeek-OCR model, and my learning process could be divided into three main steps.
First, I read several blogs and technical posts to understand its overall architecture and workflow.

Second, I began reproducing the end-to-end model, but in this stage I still need to set proper breakpoints to clearly see what happens inside each part of the model.

Third, because of the viral warts around my eyes, I had to spend about two or three hours getting them removed by laser treatment, which cost me some of my working time.

Even so, while reading those blogs, I came up with a few innovative ideas—for example, I think it might be possible to compress LaTeX layouts, so that we wouldn’t need SAM anymore since its function could be integrated into the compression model. I’m not entirely sure whether this idea is right, but it seems worth exploring further.

As for my graduation paper, I realized that I still have a lot more work to do, and I need to dedicate more effort to it. That’s all for today. I’m quite tired tonight, but I still have to spend at least an hour preparing for the civil service test.

Good night, my friend.