Neural Magic empowers enterprises to harness the power of open-source Large Language Models (LLMs). We develop cutting-edge AI inference solutions that optimize the performance of LLMs across various infrastructures, maximizing computational efficiency and minimizing costs. Our expertise lies in model quantization and sparsification, allowing organizations to deploy AI models securely and effectively. We're a Series A startup backed by leading investors like Andreessen Horowitz and NEA, working at the forefront of AI innovation. Our engineering team leverages a robust tech stack including Python, PyTorch, TensorFlow, and various deep learning libraries, along with infrastructure tools like Amazon, nginx, and Cloudflare. We foster a collaborative and innovative environment, encouraging creative problem-solving and a strong sense of ownership. We prioritize employee growth through continuous training and development opportunities and offer a flexible work environment, including remote work options. We're committed to building a diverse and inclusive workplace where every team member can thrive. Our mission is to make the power of open-source LLMs accessible to every enterprise. We strive to simplify complex Generative AI deployments, accelerating innovation and reducing barriers to entry for businesses of all sizes. This means constantly pushing boundaries and working on challenging technical problems at the cutting edge of deep learning. Join us and help shape the future of AI! Neural Magic has experienced significant growth since its founding in 2018, securing substantial Series A funding to fuel continued expansion and innovation. We're currently a team of 51-100 employees based primarily in Somerville, Massachusetts, with a distributed remote workforce. We offer a competitive benefits package inclusive of comprehensive health insurance, retirement plans, paid time off, and stock options.
📍 United States
🧭 Full-Time
🔍 Artificial Intelligence
PythonKerasMachine LearningNumpyPyTorchProduct DevelopmentAlgorithmsTensorflow