What is Hy3?
The Hy3 preview showcases Tencent Hy's latest and most sophisticated model within the Hy series, boasting an impressive 295 billion parameters arranged in a Mixture-of-Experts framework, with 21 billion parameters activated and a remarkable 3.8 billion allocated to the MTP layer, all while supporting a vast context window of up to 256,000 tokens. This innovative model marks a significant milestone as it utilizes Tencent Hy's newly enhanced infrastructure, which is specifically designed to improve its effectiveness in various practical applications such as complex reasoning, following directives, contextual learning, coding assignments, and overall inference skills. By blending swift and comprehensive cognitive processing, it can provide clear responses for basic questions while also allowing for detailed analysis of complex mathematical, programming, and logical problems. The model is engineered to demonstrate extensive capabilities in comprehending lengthy contexts, following instructions accurately, utilizing tools effectively, and executing agent workflows with precision, with evaluations performed not only against traditional benchmarks but also in realistic business and development scenarios. Additionally, its versatile design allows for effective adaptation across a wide array of situations, significantly expanding its potential for use in numerous applications, thus making it a vital tool in advancing the field.