自定义模型

如果使用的模型不是盘古或者兼容OpenAI-API的开源模型，如，闭源模型或者裸机部署的自定义推理服务，可以通过继承AbstractLLM自定义一个模型，示例代码如下：

@Slf4j
public class CustomLLM extends AbstractLLM<LLMResp> {
    /**
     * 初始化
     *
     * @param llmConfig llm参数配置
     */
    public CustomLLM(LLMConfig llmConfig) {
        super(llmConfig);
    }
    @Override
    protected LLMResp getLLMResponse(List<ConversationMessage> chatMessages, LLMParamConfig llmParamConfig) {
        // 构造请求体
        Map<String, Object> request = new HashMap<>();
        request.put("temperature", 0.3);
        request.put("data", chatMessages.stream().map(ConversationMessage::getContent).collect(Collectors.toList()));
        final String requestBody = JSON.toJSONString(request);
        log.info("request body : \n{}", JSON.toJSONString(JSON.parseObject(requestBody), true));
        // 从配置项读取url，构造post消息
        String url = ConfigLoadUtil.getStringConf(null, "llm.custom.api.url");
        if (StringUtils.isEmpty(url)) {
            throw new PanguDevSDKException("the llm.custom.api.url is not config");
        }
        HttpPost httpPost = new HttpPost(url);
        httpPost.setEntity(new StringEntity(requestBody, ContentType.APPLICATION_JSON));
        // 发送消息并处理响应
        String responseStr;
        if (llmConfig.getLlmParamConfig().isStream()) {
            // 处理流式请求
            httpPost.setHeader(new BasicHeader("Inference-Type", "stream"));
            final CloseableHttpAsyncClient httpclient = HttpUtil.getHttpAsyncClient(false);
            try {
                httpclient.start();
                final String callBackId = SecurityUtil.getUUID();
                final List<PanguChatChunk> panguChatChunks = new ArrayList<>();
                Future<HttpResponse> future = httpclient.execute(HttpAsyncMethods.create(httpPost),
                    StreamHelper.getAsyncConsumer(streamCallBack, callBackId, panguChatChunks),
                    StreamHelper.getCallBack(streamCallBack, callBackId, httpPost));
                future.get(Optional.ofNullable(llmConfig.getHttpConfig().getAsyncHttpWaitSeconds()).orElse(300),
                    TimeUnit.SECONDS);
                final PanguChatResp allRespFromChunk = StreamHelper.getAllRespFromChunk(panguChatChunks);
                return LLMResp.builder().answer(allRespFromChunk.getChoices().get(0).getMessage().getContent()).build();
            } catch (Exception e) {
                throw new PanguDevSDKException(e);
            }
        } else {
            // 处理非流式请求
            final CloseableHttpClient httpClient = HttpUtil.getHttpClient(false);
            try {
                final CloseableHttpResponse response = httpClient.execute(httpPost);
                responseStr = EntityUtils.toString(response.getEntity(), StandardCharsets.UTF_8);
                log.info("response: \n{}", JSON.toJSONString(JSON.parseObject(responseStr), true));
                // 解析结果
                final JSONObject jsonObject = JSON.parseObject(responseStr);
                JSONObject result = jsonObject.getJSONObject("result");
                if (result == null) {
                    result = jsonObject;
                }
                final String content = ((JSONObject) result.getJSONArray("answers").get(0)).getString("content");
                return LLMResp.builder().answer(content).build();
            } catch (IOException e) {
                throw new PanguDevSDKException(e);
            }
        }
    }
    @Override
    protected LLMResp getLLMResponseFromCache(String cache) {
        return LLMResp.builder().answer(cache).isFromCache(true).build();
    }
}

父主题： LLMs（语言模型）

上一篇：开源模型

下一篇：Prompt（提示词模板）

意见反馈

文档内容是否对您有帮助？

有帮助没帮助

提供反馈

提交成功！非常感谢您的反馈，我们会继续努力做到更好！您可在我的云声建议查看反馈及问题处理状态。

系统繁忙，请稍后重试

在使用文档中是否遇到以下问题

内容与产品页面不一致

内容不易理解

缺失示例代码

步骤不可操作

搜不到想要的内容

缺少最佳实践

意见反馈（选填）

0/500

请至少选择一项反馈信息并填写问题反馈

字符长度不能超过500

直接提交取消

如您有其它疑问，您也可以通过华为云社区问答频道来与我们联系探讨

智能客服提问云社区提问

自定义模型

相关文档

意见反馈

文档内容是否对您有帮助？

7*24

备案

专业服务

退订

建议反馈

售前咨询热线