GPU-accelerated Llama3.java inference in pure Java using TornadoVM github.com 47 points by pjmlp 5 days ago