fosai·Free Open-Source Artificial Intelligencebytinwhiskers New technique to run 70B LLM Inference on a single 4GB GPUhttps://ai.gopubby.com/unbelievable-run-70b-llm-inference-on-a-single-4gb-gpu-with-this-new-technique-93e2057c7eebOpen linkView original on lemmy.world18Comments5