Codersarts - Quantisation: Run Large LLMs on Consumer GPUs with 4-bit