levinxo issues

Results 3 issues of


                                            levinxo

使用最新的go1.13版本进行编译后运行提示Failed to list *v1alpha1.TrainingJob

项目目前提供的老版本镜像 https://hub.docker.com/r/tizhou86/paddle-on-k8s-operator 里强制限制了 master pod里etcd 容器的镜像（镜像为m3ngyang/etcd:v3.2.1）拉取策略为Always，在无互联网的集群中operator无法完全跑起来。所以对master分支的代码进行修改，将etcd镜像拉取策略由Always改为IfNotPresent，重新build： go version: 1.13.6 linux/amd64 go build -o paddle-on-k8s-operator ./cmd/operator 镜像Dockerfile和项目提供的Dockerfile保持一致： FROM ubuntu:18.04 ADD paddle-on-k8s-operator /usr/local/bin ENTRYPOINT ["/usr/local/bin/paddle-on-k8s-operator"] 部署operator后，错误日志如下： E0413 10:36:30.127845 1 reflector.go:205] pkg/mod/k8s.io/[email protected]+incompatible/tools/cache/reflector.go:99:...

添加add_special_tokens选项，默认true，支持chatglm模型

如题，默认为true，不影响目前chatglm的推理逻辑，为false后，将去除chatglm的special token。请帮忙review，感谢~

fastllm是否支持使用bitsandbytes量化的chatglm3-6b-base int4模型

目前使用torch2flm脚本转换chatglm3-6b-base int4模型，会产生20多G的.flm文件。请问是否支持直接将int4量化的chatglm3-6b-base模型转为flm文件并推理？