|
TinyLlama.cpp 1.0
A lightweight C++ implementation of the TinyLlama language model
|
This is the complete list of members for KVCache, including all inherited members.
| batch_seq_lens | KVCache | |
| clear_data() | KVCache | inline |
| current_batch_size | KVCache | |
| destroy_gpu_resources() | KVCache | |
| initialize(const ModelConfig &config, int total_num_model_layers, int num_gpu_layers_to_allocate, int max_seq_len_arg, int num_kv_heads, int head_dim, int max_batch_size_arg=1) | KVCache | |
| initialize_batch(int batch_size) | KVCache | inline |
| layers | KVCache | |
| max_batch_size | KVCache | |
| max_seq_len_config_ | KVCache | |
| seq_len | KVCache | |
| total_model_layers_ | KVCache | |
| ~KVCache() | KVCache | inline |