Document AI implementation references

deeplearning4j/deeplearning4j

Based on 2 comments

C++

When implementing AI algorithms or neural network operations, document the sources of specific implementation choices, especially for parameters or techniques that might appear arbitrary or unusual at first glance. Include references to established AI frameworks, research papers, or model implementations that informed your approach.

AI C++

Reviewer Prompt

This practice is particularly important for:

Magic numbers in neural network operations (like attention masks)
Mathematical formulas or equations adapted from specific libraries
Parameter choices that deviate from common defaults

Example:

// Apply mask to attention weights
// Using 1e9 as a large negative value for masked positions,
// consistent with tensor2tensor implementation.
// Note: BERT uses 1e4, GPT-2 uses 1e10
*weights += (*reshapedMask - 1) * 1e9;

This documentation helps future developers understand the rationale behind implementation decisions, facilitates accurate debugging, and enables informed modifications when updating the code. It also preserves knowledge about AI model compatibility that might otherwise be lost over time.

Comments Analyzed

C++

Primary Language

Document AI implementation references

Reviewer Prompt

Source Discussions

Add Repository

Private Repository