19:["$","section",null,{"className":"related-section","children":[["$","h2",null,{"className":"related-section__h","children":"Other angles on this story"}],["$","ul",null,{"className":"related-list","children":[["$","li","a9a3f0bf-6da4-415a-bc9f-a86f87ae01e3",{"className":"related-item","children":[["$","span",null,{"className":"related-item__num","children":"01"}],["$","$L13",null,{"href":"/en/article/plus-petitsvg-aria-hiddentrue-data-componentocticon-height16-viewbox0-0-16-16-ve-a9a3f0","className":"related-item__title","children":" openai / whisper"}],["$","span",null,{"className":"related-item__score","children":["SIG ",85]}]]}],["$","li","63c5c09e-655f-4751-8850-1b529bbb57f0",{"className":"related-item","children":[["$","span",null,{"className":"related-item__num","children":"02"}],["$","$L13",null,{"href":"/en/article/llm-anthropic-0251-63c5c0","className":"related-item__title","children":"llm-anthropic 0.25.1"}],["$","span",null,{"className":"related-item__score","children":["SIG ",85]}]]}],["$","li","4402214c-dcb0-4da9-8f23-90b770bfd2e3",{"className":"related-item","children":[["$","span",null,{"className":"related-item__num","children":"03"}],["$","$L13",null,{"href":"/en/article/plus-petitsvg-aria-hiddentrue-data-componentocticon-height16-viewbox0-0-16-16-ve-440221","className":"related-item__title","children":" facebookresearch / sam3"}],["$","span",null,{"className":"related-item__score","children":["SIG ",85]}]]}],["$","li","380489ed-d12b-40e8-8c63-74ffaf01fbad",{"className":"related-item","children":[["$","span",null,{"className":"related-item__num","children":"04"}],["$","$L13",null,{"href":"/en/article/plus-petitsvg-aria-hiddentrue-data-componentocticon-height16-viewbox0-0-16-16-ve-380489","className":"related-item__title","children":" openai / whisper"}],"$L1c"]}],"$L1d"]}]]}]

llama: use f16 mask for FA to save VRAM by am17an · Pull Request #23764 · ggml-org/llama.cpp

Other angles on this story