TinyLlama.cpp 1.0
A lightweight C++ implementation of the TinyLlama language model
Loading...
Searching...
No Matches
Public Attributes | List of all members
KVCacheLayer Struct Reference

Key-Value cache for a single transformer layer. More...

#include <model.h>

Collaboration diagram for KVCacheLayer:
Collaboration graph

Public Attributes

std::vector< float > k
 
std::vector< float > v
 

Detailed Description

Key-Value cache for a single transformer layer.

Stores the key and value tensors for attention mechanism, with optional CUDA support for GPU acceleration.

Definition at line 130 of file model.h.

Member Data Documentation

◆ k

std::vector<float> KVCacheLayer::k

◆ v

std::vector<float> KVCacheLayer::v

The documentation for this struct was generated from the following file: