Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.
Reticulate is a handy way to combine Python and R code. From the reticulate help page suggests that reticulate allows for: "Calling Python from R in a variety of ways including R Markdown, sourcing ...