文档首页/ 推荐系统 RES/ API参考/ API/ 训练作业/ 新建训练作业

更新时间：2022-05-13 GMT+08:00

查看PDF

新建训练作业

功能介绍

新建训练作业元数据，新建成功之后可手动执行此任务。

调试

您可以在API Explorer中调试该接口。

URI

POST /v2.0/{project_id}/workspaces/{workspace_id}/resources/{resource_id}/job-instance

表1 路径参数
参数	是否必选	参数类型	描述
project_id	是	String	项目id，获取方式请参见获取项目ID
resource_id	是	String	资源id
workspace_id	是	String	工作空间id

请求参数

表2 请求Header参数
参数	是否必选	参数类型	描述
Content-Type	是	String	内容类型，取值为application/json
X-Auth-Token	是	String	用户token，获取方式请参见获取用户Token

表3 请求Body参数
参数	是否必选	参数类型	描述
category	是	String	类别： RECALL，召回作业 DATASOURCE，数据源作业 FILTER，过滤作业 SORTING，排序作业 EVALUATE，效果评估作业
description	否	String	描述
job_config	是	jobConfig object	作业配置
exec_config	否	ResExecConfig object	作业执行配置项（仅离线任务可提供此配置）
job_name	是	String	作业名称，1-64位字母、数字、下划线、中划线组合最小长度：1 最大长度：64
job_type	是	String	作业类型： WriteUserProfile，用户画像实时导入作业 WriteItemProfile，物品画像实时导入作业 UniversalProcess，行为数据实时导入作业 NearlineRecall，近线召回作业 EncodeProfile，近线特征工程作业 AttributeMatch，属性匹配召回作业 AlsCF，交替最小二乘协同过滤作业 BhvHistory，历史行为记忆召回作业 ItemCf，物品协同过滤召回作业 MenEdit，人工配置候选集作业 Ucb，UCB召回作业 UserCf，用户协同过滤召回作业 WeightBehavior，综合行为热度召回作业 Filter，历史行为过滤作业 AutoPreRank，智能ETL参数生成作业 ETL，离线特征工程作业 LR，LR作业 DEEPFM，DEEPFM作业 AutoGroup，AutoGroup作业 StreamRank，在线训练作业 DataStruct，识别数据结构作业 DataExploration，数据探索作业 DataImport，离线数据导入作业 Evaluate，效果评估作业
schedule	否	String	调度参数

表4 jobConfig
参数	是否必选	参数类型	描述
nearline_recall_param	否	NearLineRecallParam object	近线召回作业参数（近线召回作业需要提供此参数）。
max_recommended_num	否	Integer	最大候选集个数（所有召回作业需提供此参数）。最小值：1 最大值：200
match_feature_pairs	否	Array of MatchFeaturePair objects	匹配特征对（属性匹配召回作业需要提供此参数）。
striping	否	Striping object	行条化策略（属性匹配召回作业、物品协同过滤召回作业、用户协同过滤召回作业需要提供此参数）。
match_type	否	String	匹配类型（属性匹配召回作业需提供此参数）： UI，基于用户推荐物品 UU，基于用户推荐用户 II，基于物品推荐物品 IU，基于物品推荐用户
matrix_factorization	否	MatrixFactorization object	矩阵分解参数配置（交替最小二乘协同过滤作业需提供此参数）。
behavior_frequencys	否	Array of BehaviorFrequency objects	行为频率信息（历史行为记忆召回作业、历史行为过滤作业需提供此参数）。
file_path	否	String	文件路径（人工配置候选集作业需要提供此参数）。
ucb_param	否	UcbParam object	UCB作业参数（UCB召回作业需要提供此参数）。
behavior_gravity	否	BehaviorGravity object	重力衰减因子（综合行为热度召回作业需要提供此参数）。
category	否	Category object	类别（综合行为热度召回作业需要提供此参数）。
behavior_logic	否	String	行为逻辑过滤（历史行为过滤作业需提供此参数）： AND，同时满足则过滤 OR，满足一个则过滤
features_engineering	否	EtlBasicParameter object	特征参数（离线特征工程作业需要提供此参数）。
sample_param	否	SampleParam object	样本参数（离线特征工程作业需要提供此参数）。
deep_learning_parameters	否	DeepLearingParam object	排序作业通用参数（LR、DEEPFM、AutoGroup需要提供此参数）。
algorithm_specify_parameters	否	AlgorithmSpecifyParameters object	排序算法特定参数（LR、DEEPFM、AutoGroup需要提供此参数）。
load_widetable	否	Boolean	导入宽表（离线数据导入作业需要提供此参数）。
load_profile	否	Boolean	导入画像（离线数据导入作业需要提供此参数）。
save_mode	否	String	保留已有宽表（离线数据导入作业需要提供此参数）： append，是 new，否 overwirte，覆盖
indicators	否	Array of Indicator objects	统计指标（效果评估作业需要提供此参数）。
offline_rank_job_name	否	String	离线排序作业名称（在线训练任务需要提供此参数）。
update_interval	否	Integer	更新周期（在线训练任务需要提供此参数）。
optimizer	否	Optimizer object	优化器（在线训练任务需要提供此参数）。
flows	否	Flow object	在线流程（在线训练任务需要提供此参数）。

表5 NearLineRecallParam
参数	是否必选	参数类型	描述
time_limit	否	Boolean	时间过滤。
timeFeature	否	String	时间特征。
retainDays	否	Integer	保留期(天)。
recall_fileds	否	Array of RecallFiled objects	召回字段。
itemCF_job_name	否	String	物品协同过滤作业名称。

表6 RecallFiled
参数	是否必选	参数类型	描述
name	否	String	字段名称。
value	否	Integer	使用字段值的个数。最小值：1 最大值：10 缺省值：1

表7 MatchFeaturePair
参数	是否必选	参数类型	描述
user_feature_name	否	String	用户特征。
item_feature_name	否	String	物品特征。
weight	否	Double	权重。
match_count	否	Boolean	匹配个数度量。

表8 Striping
参数	是否必选	参数类型	描述
nearest_neighborhood	是	Integer	最近领域个数。
band	是	Integer	相似程度。最小值：1 最大值：20
row	是	Integer	相似距离。最小值：1 最大值：10

表9 MatrixFactorization
参数	是否必选	参数类型	描述
implicit_vector_rank	是	Integer	隐向量维度。最小值：1 最大值：100
regular_param	是	Double	优化正则化系数。最小值：1.0E-8 最大值：1
max_iterator_num	是	Integer	迭代次数。最小值：1 最大值：50

**表10** BehaviorFrequency
参数	是否必选	参数类型	描述
behavior_type	是	String	行为类型： view，曝光 click，点击 collect，收藏 uncollect，取消收藏 search_click，搜索后点击 comment，评论 share，分享 like，点赞 dislike，点衰 grade，评分 consume，消费 use，观看视频/听音乐/阅读 download，下载 tip，打赏 subscribe，关注
lower_limit	否	Integer	最小次数。最小值：1
upper_limit	否	Integer	最大次数。最小值：1
time_interval	是	Integer	时间区间。最小值：1

**表11** UcbParam
参数	是否必选	参数类型	描述
alpha	是	Double	折中参数。最小值：0 最大值：1
min_used_num	是	Integer	最小行为次数。最小值：30 最大值：1000

**表12** BehaviorGravity
参数	是否必选	参数类型	描述
weaken_factor	否	Double	衰减因子。最小值：0.1 最大值：5
view_type	否	String	行为次数统计方法： pv，访问量 uv，独立访客
algo_type	否	String	算法类型: normal，通用 time，时间

**表13** Category
参数	是否必选	参数类型	描述
user_meta_list	否	Array of strings	用户特征。
item_meta_list	否	Array of strings	物品特征。

**表14** EtlBasicParameter
参数	是否必选	参数类型	描述
user_features	否	Array of FeatureTransformation objects	用户特征。
item_features	否	Array of FeatureTransformation objects	物品特征。
rank_etl_filter	否	RankETLFilter object	过滤参数。

**表15** FeatureTransformation
参数	是否必选	参数类型	描述
attr	否	Attribute object	特征。
discrete_method	否	String	离散方法： equal_distance_discrete，等距离散 user_define_discrete，自定义离散 normalize，归一化 null，不离散
params	否	Object	具体处理参数。

**表16** Attribute
参数	是否必选	参数类型	描述
name	是	String	名称。
data_type	否	String	数据类型。
other_uses	否	Array of strings	其他用途。

**表17** RankETLFilter
参数	是否必选	参数类型	描述
filter_type	是	String	行为去重方式： abs_weight，权重绝对值 date，日期
time_type	是	String	时间类型： day，天 week，周 month，月
is_monday_first	否	Boolean	周一是否是第一天。

**表18** SampleParam
参数	是否必选	参数类型	描述
divide_type	是	String	训练集测试集划分方式： TIME，时间比例 RAMDOM，个数比例
train_rate	否	Double	训练数据占比。最小值：0.01 最大值：1
test_rate	否	Double	测试数据占比。最小值：0.01 最大值：1

**表19** DeepLearingParam
参数	是否必选	参数类型	描述
initial_parameters	否	Initial object	初始化参数。
optimize_parameters	否	Optimizer object	优化参数。
regular_parameters	否	Regular object	正则化参数。
max_iterations	否	Integer	最大迭代轮数。最小值：1 最大值：1000
early_stop_iterations	否	Integer	提前终止训练轮数。最小值：1 最大值：1000
batch_size	否	Integer	批量大小。最小值：1
dataset_split_parts	否	Integer	训练数据集切分数量。最小值：1 最大值：10
restart_train	否	Boolean	重新训练。

**表20** Initial
参数	是否必选	参数类型	描述
initial_method	是	String	初始化方法。
mean_value	否	Double	平均值。最小值：-1 最大值：1
standard_deviation	否	Double	标准差。最小值：0 最大值：1
min_value	否	Double	最小值。最小值：-1 最大值：0
max_value	否	Double	最大值。最小值：0 最大值：1

**表21** Optimizer
参数	是否必选	参数类型	描述
type	否	String	优化器类型。
learning_rate	否	Double	学习率。
initial_accumulator_value	否	Double	初始梯度累加和。最小值：0 最大值：1
lambda1	否	Double	L1正则项系数。最小值：0 最大值：1
lambda2	否	Double	L2正则项系数。最小值：0 最大值：1
epsilon	否	Double	数值稳定常量。最小值：0 最大值：1
decay_rate	否	Double	衰减因子。最小值：0 最大值：1
decay_steps	否	Double	衰减步长。最小值：1

**表22** Regular
参数	是否必选	参数类型	描述
l2_regularization	否	Double	L2正则项系数。最小值：0 最大值：1
regular_loss_compute_mode	否	String	正则损失计算方式。
embed_l2_regularization	否	Double	隐向量层L2正则化系数。最小值：0 最大值：1
wide_l2_regularization	否	Double	wide部分L2正则化系数。最小值：0 最大值：1
structure_l2_regularization	否	Double	结构化部分L2正则化系数。最小值：0 最大值：1

**表23** AlgorithmSpecifyParameters
参数	是否必选	参数类型	描述
latent_vector_length	否	Integer	隐向量长度(DEEPFM需要提供此参数)。最小值：1 最大值：100
architecture	否	Array of integers	神经网络结构(DEEPFM需要提供此参数)。
active_function	否	String	激活函数(DEEPFM需要提供此参数,AutoGroup需要提供此参数)。
value_keep_probability	否	Double	神经元值保留概率(DEEPFM需要提供此参数,AutoGroup需要提供此参数)。最小值：0 最大值：1
embed_size	否	Array of integers	各阶隐向量长度(AutoGroup需要提供此参数)。
mlp_architecture	否	Array of integers	神经网络结构(AutoGroup需要提供此参数)。
max_order	否	Integer	最大交互阶数(AutoGroup需要提供此参数)。
hash_sizes	否	Array of integers	哈希长度(AutoGroup需要提供此参数)。
hash_compensation	否	Array of numbers	特征交互层惩罚项系数(AutoGroup需要提供此参数)。
use_wide_part	否	Boolean	使用线性部分(AutoGroup需要提供此参数)。
structure_optimizer	否	Optimizer object	优化器参数(AutoGroup需要提供此参数)。
merge_multi_hot	否	Boolean	融合多值特征(AutoGroup需要提供此参数)。
fix_structure	否	Boolean	固定哈希结构(AutoGroup需要提供此参数)。

**表24** Indicator
参数	是否必选	参数类型	描述
indicator_name	否	String	指标名称： clickPVRate，点击PV率 clickUVRate，点击UV率 customize，自定义
indicator_params	否	IndicatorParam object	指标参数（自定义指标需要提供）。

**表25** IndicatorParam
参数	是否必选	参数类型	描述
customize_parameter	否	CustomizeParameter object	自定义参数。
customize_formula	否	CustomizeFormula object	自定义公式。

**表26** CustomizeParameter
参数	是否必选	参数类型	描述
alias	是	String	别名。
behavior_type	是	String	行为类型。
threshold	否	Double	阈值。最小值：0 最大值：1
deduplication	是	String	去重。

**表27** CustomizeFormula
参数	是否必选	参数类型	描述
alias	否	String	别名。
formula	否	String	公式。

**表28** Flow
参数	是否必选	参数类型	描述
flow_id	是	String	流程id。
attr_pair_rules_filter	否	Array of AttrPairRules objects	属性对过滤。
attr_pair_rules_reserve	否	Array of AttrPairRules objects	属性对保留。
deduplication_list	否	Array of Deduplication objects	属性去重。
attribute_info	否	AttributeInfo object	综合排序信息。
bloom_filter_conf	否	BloomFilterConf object	布隆过滤器配置。
group_attr	否	String	分组打散属性。
pre_deal	否	Boolean	在排序前去重。
rank_setting	否	String	排序配置信息。
rules	否	Rule object	候选集融合。
filter_sets	否	Array of strings	过滤配置信息。
attr_value_rules_filter	否	Array of AttrValueRules objects	属性值过滤。
attr_value_rules_reserve	否	Array of AttrValueRules objects	属性值保留。
ctr_job	否	String	排序作业（使用点击率预估时需要提供此参数）。
ratio	否	Integer	流量占比。最小值：1 最大值：100
toppings	否	Array of strings	需要置顶的候选集列表。

**表29** AttrPairRules
参数	是否必选	参数类型	描述
attr_pairs	否	Array of AttrPair objects	属性对。

**表30** AttrPair
参数	是否必选	参数类型	描述
party_a	否	String	被推荐对象的属性名。
party_b	否	String	被推荐对象的属性名。

**表31** Deduplication
参数	是否必选	参数类型	描述
attributes	否	Array of strings	属性。

**表32** AttributeInfo
参数	是否必选	参数类型	描述
rank_feature_pairs	否	Array of RankFeaturePair objects	属性匹配对。
numerical_attrs	否	Array of NumericalAttr objects	属性权重。
num_statistics_type	否	String	统计方式： ORDER，顺序 ABS，绝对值

**表33** RankFeaturePair
参数	是否必选	参数类型	描述
feature_name_a	否	String	待推荐对象的属性。
feature_name_b	否	String	被推荐对象的属性。
weight	否	Float	权重。最小值：0.01 最大值：1

**表34** NumericalAttr
参数	是否必选	参数类型	描述
name	是	String	特征名。
weight	是	Float	权重。最小值：0.001 最大值：1

**表35** BloomFilterConf
参数	是否必选	参数类型	描述
behaviors	否	Array of strings	待过滤行为类型。
interval	否	Integer	过滤时间。最小值：1 最大值：7

**表36** Rule
参数	是否必选	参数类型	描述
table_name	是	String	候选集表名。
rule_ratio	是	Integer	规则占比。最小值：1 最大值：100
priority	是	Integer	优先级。最小值：1 最大值：10

**表37** AttrValueRules
参数	是否必选	参数类型	描述
attr_values_a	否	Array of AttrValue objects	被推荐对象的属性-值配置。
attr_values_b	是	Array of AttrValue objects	待推荐对象的属性-值配置。

**表38** AttrValue
参数	是否必选	参数类型	描述
name	是	String	属性名称。
value	是	String	属性值。

**表39** ResExecConfig
参数	是否必选	参数类型	描述
spark_calc_spec	否	SparkCalcSpec object	spark自定义计算规格
spark_option_confs	否	Array of SparkOptionConf objects	spark可选配置项

**表40** SparkCalcSpec
参数	是否必选	参数类型	描述
driver_memory	是	String	driver内存
driver_cores	是	Integer	driver核数
executor_memory	是	String	executor内存
executor_cores	是	Integer	executor核数
num_executors	是	Integer	executor个数

**表41** SparkOptionConf
参数	是否必选	参数类型	描述
name	否	String	参数名称
value	否	String	参数值

响应参数

状态码： 200

**表42** 响应Body参数
参数	参数类型	描述
is_success	Boolean	是否成功
job	jobs object	作业信息
message	String	返回消息（请求成功时，不返回此字段）
error_code	String	错误码（请求成功时，不返回此字段）

**表43** jobs
参数	参数类型	描述
category	String	类别。
config_info	String	配置信息。
description	String	描述。
job_id	String	作业id。
job_name	String	作业名称。
job_type	String	作业类型。
next_schedule_time	Integer	下次调度时间。
platform	String	平台。
resource_id	String	资源id。
schedule	String	调度参数。
status	String	状态。
workspace_id	String	工作空间id。
job_config	jobConfig object	作业配置。

**表44** jobConfig
参数	参数类型	描述
nearline_recall_param	NearLineRecallParam object	近线召回作业参数（近线召回作业需要提供此参数）。
max_recommended_num	Integer	最大候选集个数（所有召回作业需提供此参数）。最小值：1 最大值：200
match_feature_pairs	Array of MatchFeaturePair objects	匹配特征对（属性匹配召回作业需要提供此参数）。
striping	Striping object	行条化策略（属性匹配召回作业、物品协同过滤召回作业、用户协同过滤召回作业需要提供此参数）。
match_type	String	匹配类型（属性匹配召回作业需提供此参数）： UI，基于用户推荐物品 UU，基于用户推荐用户 II，基于物品推荐物品 IU，基于物品推荐用户
matrix_factorization	MatrixFactorization object	矩阵分解参数配置（交替最小二乘协同过滤作业需提供此参数）。
behavior_frequencys	Array of BehaviorFrequency objects	行为频率信息（历史行为记忆召回作业、历史行为过滤作业需提供此参数）。
file_path	String	文件路径（人工配置候选集作业需要提供此参数）。
ucb_param	UcbParam object	UCB作业参数（UCB召回作业需要提供此参数）。
behavior_gravity	BehaviorGravity object	重力衰减因子（综合行为热度召回作业需要提供此参数）。
category	Category object	类别（综合行为热度召回作业需要提供此参数）。
behavior_logic	String	行为逻辑过滤（历史行为过滤作业需提供此参数）： AND，同时满足则过滤 OR，满足一个则过滤
features_engineering	EtlBasicParameter object	特征参数（离线特征工程作业需要提供此参数）。
sample_param	SampleParam object	样本参数（离线特征工程作业需要提供此参数）。
deep_learning_parameters	DeepLearingParam object	排序作业通用参数（LR、DEEPFM、AutoGroup需要提供此参数）。
algorithm_specify_parameters	AlgorithmSpecifyParameters object	排序算法特定参数（LR、DEEPFM、AutoGroup需要提供此参数）。
load_widetable	Boolean	导入宽表（离线数据导入作业需要提供此参数）。
load_profile	Boolean	导入画像（离线数据导入作业需要提供此参数）。
save_mode	String	保留已有宽表（离线数据导入作业需要提供此参数）： append，是 new，否 overwirte，覆盖
indicators	Array of Indicator objects	统计指标（效果评估作业需要提供此参数）。
offline_rank_job_name	String	离线排序作业名称（在线训练任务需要提供此参数）。
update_interval	Integer	更新周期（在线训练任务需要提供此参数）。
optimizer	Optimizer object	优化器（在线训练任务需要提供此参数）。
flows	Flow object	在线流程（在线训练任务需要提供此参数）。

**表45** NearLineRecallParam
参数	参数类型	描述
time_limit	Boolean	时间过滤。
timeFeature	String	时间特征。
retainDays	Integer	保留期(天)。
recall_fileds	Array of RecallFiled objects	召回字段。
itemCF_job_name	String	物品协同过滤作业名称。

**表46** RecallFiled
参数	参数类型	描述
name	String	字段名称。
value	Integer	使用字段值的个数。最小值：1 最大值：10 缺省值：1

**表47** MatchFeaturePair
参数	参数类型	描述
user_feature_name	String	用户特征。
item_feature_name	String	物品特征。
weight	Double	权重。
match_count	Boolean	匹配个数度量。

**表48** Striping
参数	参数类型	描述
nearest_neighborhood	Integer	最近领域个数。
band	Integer	相似程度。最小值：1 最大值：20
row	Integer	相似距离。最小值：1 最大值：10

**表49** MatrixFactorization
参数	参数类型	描述
implicit_vector_rank	Integer	隐向量维度。最小值：1 最大值：100
regular_param	Double	优化正则化系数。最小值：1.0E-8 最大值：1
max_iterator_num	Integer	迭代次数。最小值：1 最大值：50

**表50** BehaviorFrequency
参数	参数类型	描述
behavior_type	String	行为类型： view，曝光 click，点击 collect，收藏 uncollect，取消收藏 search_click，搜索后点击 comment，评论 share，分享 like，点赞 dislike，点衰 grade，评分 consume，消费 use，观看视频/听音乐/阅读 download，下载 tip，打赏 subscribe，关注
lower_limit	Integer	最小次数。最小值：1
upper_limit	Integer	最大次数。最小值：1
time_interval	Integer	时间区间。最小值：1

**表51** UcbParam
参数	参数类型	描述
alpha	Double	折中参数。最小值：0 最大值：1
min_used_num	Integer	最小行为次数。最小值：30 最大值：1000

**表52** BehaviorGravity
参数	参数类型	描述
weaken_factor	Double	衰减因子。最小值：0.1 最大值：5
view_type	String	行为次数统计方法： pv，访问量 uv，独立访客
algo_type	String	算法类型: normal，通用 time，时间

**表53** Category
参数	参数类型	描述
user_meta_list	Array of strings	用户特征。
item_meta_list	Array of strings	物品特征。

**表54** EtlBasicParameter
参数	参数类型	描述
user_features	Array of FeatureTransformation objects	用户特征。
item_features	Array of FeatureTransformation objects	物品特征。
rank_etl_filter	RankETLFilter object	过滤参数。

**表55** FeatureTransformation
参数	参数类型	描述
attr	Attribute object	特征。
discrete_method	String	离散方法： equal_distance_discrete，等距离散 user_define_discrete，自定义离散 normalize，归一化 null，不离散
params	Object	具体处理参数。

**表56** Attribute
参数	参数类型	描述
name	String	名称。
data_type	String	数据类型。
other_uses	Array of strings	其他用途。

**表57** RankETLFilter
参数	参数类型	描述
filter_type	String	行为去重方式： abs_weight，权重绝对值 date，日期
time_type	String	时间类型： day，天 week，周 month，月
is_monday_first	Boolean	周一是否是第一天。

**表58** SampleParam
参数	参数类型	描述
divide_type	String	训练集测试集划分方式： TIME，时间比例 RAMDOM，个数比例
train_rate	Double	训练数据占比。最小值：0.01 最大值：1
test_rate	Double	测试数据占比。最小值：0.01 最大值：1

**表59** DeepLearingParam
参数	参数类型	描述
initial_parameters	Initial object	初始化参数。
optimize_parameters	Optimizer object	优化参数。
regular_parameters	Regular object	正则化参数。
max_iterations	Integer	最大迭代轮数。最小值：1 最大值：1000
early_stop_iterations	Integer	提前终止训练轮数。最小值：1 最大值：1000
batch_size	Integer	批量大小。最小值：1
dataset_split_parts	Integer	训练数据集切分数量。最小值：1 最大值：10
restart_train	Boolean	重新训练。

**表60** Initial
参数	参数类型	描述
initial_method	String	初始化方法。
mean_value	Double	平均值。最小值：-1 最大值：1
standard_deviation	Double	标准差。最小值：0 最大值：1
min_value	Double	最小值。最小值：-1 最大值：0
max_value	Double	最大值。最小值：0 最大值：1

**表61** Optimizer
参数	参数类型	描述
type	String	优化器类型。
learning_rate	Double	学习率。
initial_accumulator_value	Double	初始梯度累加和。最小值：0 最大值：1
lambda1	Double	L1正则项系数。最小值：0 最大值：1
lambda2	Double	L2正则项系数。最小值：0 最大值：1
epsilon	Double	数值稳定常量。最小值：0 最大值：1
decay_rate	Double	衰减因子。最小值：0 最大值：1
decay_steps	Double	衰减步长。最小值：1

**表62** Regular
参数	参数类型	描述
l2_regularization	Double	L2正则项系数。最小值：0 最大值：1
regular_loss_compute_mode	String	正则损失计算方式。
embed_l2_regularization	Double	隐向量层L2正则化系数。最小值：0 最大值：1
wide_l2_regularization	Double	wide部分L2正则化系数。最小值：0 最大值：1
structure_l2_regularization	Double	结构化部分L2正则化系数。最小值：0 最大值：1

**表63** AlgorithmSpecifyParameters
参数	参数类型	描述
latent_vector_length	Integer	隐向量长度(DEEPFM需要提供此参数)。最小值：1 最大值：100
architecture	Array of integers	神经网络结构(DEEPFM需要提供此参数)。
active_function	String	激活函数(DEEPFM需要提供此参数,AutoGroup需要提供此参数)。
value_keep_probability	Double	神经元值保留概率(DEEPFM需要提供此参数,AutoGroup需要提供此参数)。最小值：0 最大值：1
embed_size	Array of integers	各阶隐向量长度(AutoGroup需要提供此参数)。
mlp_architecture	Array of integers	神经网络结构(AutoGroup需要提供此参数)。
max_order	Integer	最大交互阶数(AutoGroup需要提供此参数)。
hash_sizes	Array of integers	哈希长度(AutoGroup需要提供此参数)。
hash_compensation	Array of numbers	特征交互层惩罚项系数(AutoGroup需要提供此参数)。
use_wide_part	Boolean	使用线性部分(AutoGroup需要提供此参数)。
structure_optimizer	Optimizer object	优化器参数(AutoGroup需要提供此参数)。
merge_multi_hot	Boolean	融合多值特征(AutoGroup需要提供此参数)。
fix_structure	Boolean	固定哈希结构(AutoGroup需要提供此参数)。

**表64** Indicator
参数	参数类型	描述
indicator_name	String	指标名称： clickPVRate，点击PV率 clickUVRate，点击UV率 customize，自定义
indicator_params	IndicatorParam object	指标参数（自定义指标需要提供）。

**表65** IndicatorParam
参数	参数类型	描述
customize_parameter	CustomizeParameter object	自定义参数。
customize_formula	CustomizeFormula object	自定义公式。

**表66** CustomizeParameter
参数	参数类型	描述
alias	String	别名。
behavior_type	String	行为类型。
threshold	Double	阈值。最小值：0 最大值：1
deduplication	String	去重。

**表67** CustomizeFormula
参数	参数类型	描述
alias	String	别名。
formula	String	公式。

**表68** Flow
参数	参数类型	描述
flow_id	String	流程id。
attr_pair_rules_filter	Array of AttrPairRules objects	属性对过滤。
attr_pair_rules_reserve	Array of AttrPairRules objects	属性对保留。
deduplication_list	Array of Deduplication objects	属性去重。
attribute_info	AttributeInfo object	综合排序信息。
bloom_filter_conf	BloomFilterConf object	布隆过滤器配置。
group_attr	String	分组打散属性。
pre_deal	Boolean	在排序前去重。
rank_setting	String	排序配置信息。
rules	Rule object	候选集融合。
filter_sets	Array of strings	过滤配置信息。
attr_value_rules_filter	Array of AttrValueRules objects	属性值过滤。
attr_value_rules_reserve	Array of AttrValueRules objects	属性值保留。
ctr_job	String	排序作业（使用点击率预估时需要提供此参数）。
ratio	Integer	流量占比。最小值：1 最大值：100
toppings	Array of strings	需要置顶的候选集列表。

**表69** AttrPairRules
参数	参数类型	描述
attr_pairs	Array of AttrPair objects	属性对。

**表70** AttrPair
参数	参数类型	描述
party_a	String	被推荐对象的属性名。
party_b	String	被推荐对象的属性名。

**表71** Deduplication
参数	参数类型	描述
attributes	Array of strings	属性。

**表72** AttributeInfo
参数	参数类型	描述
rank_feature_pairs	Array of RankFeaturePair objects	属性匹配对。
numerical_attrs	Array of NumericalAttr objects	属性权重。
num_statistics_type	String	统计方式： ORDER，顺序 ABS，绝对值

**表73** RankFeaturePair
参数	参数类型	描述
feature_name_a	String	待推荐对象的属性。
feature_name_b	String	被推荐对象的属性。
weight	Float	权重。最小值：0.01 最大值：1

**表74** NumericalAttr
参数	参数类型	描述
name	String	特征名。
weight	Float	权重。最小值：0.001 最大值：1

**表75** BloomFilterConf
参数	参数类型	描述
behaviors	Array of strings	待过滤行为类型。
interval	Integer	过滤时间。最小值：1 最大值：7

**表76** Rule
参数	参数类型	描述
table_name	String	候选集表名。
rule_ratio	Integer	规则占比。最小值：1 最大值：100
priority	Integer	优先级。最小值：1 最大值：10

**表77** AttrValueRules
参数	参数类型	描述
attr_values_a	Array of AttrValue objects	被推荐对象的属性-值配置。
attr_values_b	Array of AttrValue objects	待推荐对象的属性-值配置。

**表78** AttrValue
参数	参数类型	描述
name	String	属性名称。
value	String	属性值。

请求示例

物品协同过滤召回作业

 /v2.0/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/workspaces/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/resources/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/job-instance

{
  "job_name" : "itemcf-test",
  "description" : "itemcf",
  "category" : "RECALL",
  "job_type" : "ItemCf",
  "job_config" : {
    "striping" : {
      "nearest_neighborhood" : 50,
      "band" : 4,
      "row" : 5
    },
    "max_recommended_num" : 10
  }
}

智能ETL参数生成作业

 /v2.0/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/workspaces/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/resources/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/job-instance

{
  "job_name" : "AutoPreRank-test",
  "description" : "AutoPreRank",
  "category" : "SORTING",
  "job_type" : "AutoPreRank",
  "job_config" : { }
}

属性匹配召回作业

 /v2.0/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/workspaces/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/resources/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/job-instance

{
  "job_name" : "attribute-test",
  "description" : "attribute",
  "category" : "RECALL",
  "job_type" : "AttributeMatch",
  "job_config" : {
    "match_feature_pairs" : [ {
      "user_feature_name" : "tags",
      "item_feature_name" : "tags",
      "weight" : 1
    } ],
    "striping" : {
      "nearest_neighborhood" : 50,
      "band" : 4,
      "row" : 5
    },
    "max_recommended_num" : 10
  }
}

离线特征工程作业

 /v2.0/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/workspaces/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/resources/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/job-instance

{
  "job_name" : "ETL-test",
  "description" : "ETL",
  "category" : "SORTING",
  "job_type" : "ETL",
  "job_config" : {
    "features_engineering" : {
      "user_features" : [ {
        "attr" : {
          "name" : "age",
          "data_type" : "long"
        },
        "discrete_method" : "user_define_discrete",
        "params" : {
          "discrete_periods" : [ 1, 31, 52, 73, 94, 100 ]
        }
      }, {
        "attr" : {
          "name" : "gender",
          "data_type" : "string"
        },
        "params" : { }
      }, {
        "attr" : {
          "name" : "tags",
          "data_type" : "strArray"
        },
        "params" : {
          "value_preserve_number" : 3
        }
      }, {
        "attr" : {
          "name" : "extend_float",
          "data_type" : "float"
        },
        "discrete_method" : "normalize",
        "params" : {
          "lower_limit" : 1,
          "upper_limit" : 10
        }
      } ],
      "item_features" : [ {
        "attr" : {
          "name" : "extend_float",
          "data_type" : "float"
        },
        "discrete_method" : "null"
      }, {
        "attr" : {
          "name" : "extend_string",
          "data_type" : "string"
        },
        "params" : { }
      }, {
        "attr" : {
          "name" : "extend_strArray",
          "data_type" : "strArray"
        },
        "params" : {
          "value_preserve_number" : 3
        }
      } ],
      "rank_etl_filter" : {
        "filter_type" : "date",
        "time_type" : "day",
        "is_monday_first" : true
      }
    },
    "sample_param" : {
      "divide_type" : "TIME",
      "train_rate" : 0.7,
      "test_rate" : 0.3
    }
  }
}

近线召回作业

/v2.0/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/workspaces/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/resources/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/instance

{
  "job_name" : "NearlineRecall-test",
  "description" : "nearlinerecall",
  "category" : "RECALL",
  "job_type" : "NearlineRecall",
  "job_config" : {
    "nearline_recall_param" : {
      "time_limit" : true,
      "time_feature" : "publishTime",
      "retain_days" : 2,
      "recall_fileds" : [ {
        "name" : "author",
        "value" : 1
      }, {
        "name" : "category",
        "value" : 1
      }, {
        "name" : "tags",
        "value" : 3
      } ],
      "max_recommended_num" : 100
    }
  }
}

离线数据导入作业

/v2.0/testuuidxxxxxxxxxxxxxxxxxxxxxxxx//workspaces/testuuidxxxxxxxxxxxxxxxxxxxxxxxx//resources/testuuidxxxxxxxxxxxxxxxxxxxxxxxx//job-instance

{
  "job_name" : "dataimport-test",
  "description" : "dataimport",
  "category" : "DATASOURCE",
  "job_type" : "DataImport",
  "job_config" : {
    "load_widetable" : true,
    "load_profile" : true
  }
}

物品画像实时导入作业

/v2.0/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/workspaces/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/resources/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/instance

{
  "job_name" : "WriteItemProfile-test",
  "description" : "item",
  "category" : "DATASOURCE",
  "job_type" : "WriteItemProfile",
  "job_config" : { }
}

历史行为过滤作业

 /v2.0/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/workspaces/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/resources/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/job-instance

{
  "job_name" : "Filter-test",
  "description" : "Filter",
  "category" : "FILTER",
  "job_type" : "Filter",
  "job_config" : {
    "behavior_frequencys" : [ {
      "behavior_type" : "view",
      "lower_limit" : 1,
      "upper_limit" : 200,
      "time_interval" : 1000
    }, {
      "behavior_type" : "click",
      "lower_limit" : 1,
      "upper_limit" : 200,
      "time_interval" : 1000
    } ],
    "behavior_logic" : "OR",
    "max_recommended_num" : 10
  }
}

LR作业

/v2.0/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/workspaces/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/resources/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/instance

{
  "job_name" : "LR-test",
  "description" : "lr",
  "category" : "SORTING",
  "job_type" : "LR",
  "job_config" : {
    "algorithm_specify_parameters" : { },
    "deep_learning_parameters" : {
      "max_iterations" : 50,
      "regular_parameters" : {
        "l2_regularization" : 0,
        "regular_loss_compute_mode" : "full"
      },
      "early_stop_iterations" : 5,
      "initial_parameters" : {
        "initial_method" : "normal",
        "mean_value" : 0,
        "standard_deviation" : 0.001
      },
      "optimize_parameters" : {
        "type" : "adam",
        "learning_rate" : 0.001,
        "epsilon" : 1.0E-8
      }
    }
  }
}

数据探索作业

 /v2.0/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/workspaces/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/resources/testuuidxxxxxxxxxxxxxxxxxxxxxxxx/job-instance

{
  "job_name" : "DataExploration-test",
  "description" : "DataExploration",
  "category" : "DATASOURCE",
  "job_type" : "DataExploration",
  "job_config" : { }
}

响应示例

状态码： 200

{
  "job" : {
    "category" : "RECALL",
    "job_name" : "ae1",
    "job_id" : "388af6ad73cb49dcaf058b929fe2ecbb",
    "description" : "",
    "job_type" : "AlsCF",
    "schedule" : "00 50 01 * * ?",
    "status" : "Draft",
    "next_schedule_time" : 1588873800000,
    "platform" : "OFFLINE",
    "workspace_id" : "06a7c49afc00d4972ff1c0113d8c49ba",
    "resource_id" : "286f3ab20331476b9c731ac32c97236f"
  },
  "is_success" : true
}

状态码

状态码	描述
200	OK

错误码

请参见错误码。

父主题： 训练作业

上一篇：训练作业

下一篇：新建多个训练作业

意见反馈

文档内容是否对您有帮助？

有帮助没帮助

提供反馈

提交成功！非常感谢您的反馈，我们会继续努力做到更好！您可在我的云声建议查看反馈及问题处理状态。

系统繁忙，请稍后重试

在使用文档中是否遇到以下问题

内容与产品页面不一致

内容不易理解

缺失示例代码

步骤不可操作

搜不到想要的内容

缺少最佳实践

意见反馈（选填）

0/500

请至少选择一项反馈信息并填写问题反馈

字符长度不能超过500

直接提交取消

如您有其它疑问，您也可以通过华为云社区问答频道来与我们联系探讨

智能客服提问云社区提问

新建训练作业

功能介绍

调试

URI

请求参数

响应参数

请求示例

响应示例

状态码

错误码

相关文档

意见反馈

文档内容是否对您有帮助？

7*24

备案

专业服务

退订

建议反馈

售前咨询热线