Optimize AI implementation patterns

ollama/ollama

Based on 3 comments

Other

When implementing AI systems, prioritize established patterns and optimizations rather than creating new implementations for existing problems. This applies to all aspects of AI development:

AI Other

Reviewer Prompt

When implementing AI systems, prioritize established patterns and optimizations rather than creating new implementations for existing problems. This applies to all aspects of AI development:

For model operations, prefer reusing existing functions over implementing new ones when they serve the same purpose. For example, use ggml_scale(ctx, cur, -1) instead of implementing a separate ggml_neg operation, as it achieves the same result with equivalent performance and memory usage.
When designing data structures for AI models (like vocabulary handling), ensure they account for all possible variations and edge cases. For instance, when handling end-of-generation tokens, remember there may be multiple token types (EOS, EOT) rather than assuming a single one.
Maintain precision in documentation of AI tools to ensure clarity for users with different experience levels.

// Instead of implementing new operations:
// ggml_tensor * result = ggml_neg(ctx, tensor);

// Prefer reusing existing operations:
ggml_tensor * result = ggml_scale(ctx, tensor, -1);

// Instead of:
struct vocab {
    uint32_t eog_token; // Single end token
};

// Prefer:
struct vocab {
    std::vector<uint32_t> eog_tokens; // Multiple possible end tokens (EOS, EOT)
};

These optimizations improve code maintainability, reduce potential bugs, and often lead to better performance in AI systems that already have high computational demands.

Comments Analyzed

Other

Primary Language

Optimize AI implementation patterns

Reviewer Prompt

Source Discussions

Add Repository

Private Repository