Vertex AI 模型批量预测、引用云存储上的现有模型和输入文件的问题

2024-03-07

我正在努力正确设置 Vertex AI 管道，该管道执行以下操作：

从 API 读取数据并存储到 GCS 并作为批量预测的输入。
获取现有模型（Vertex AI 上的视频分类）
使用点 1 的输入创建批量预测作业。
正如您将看到的，我对 Vertex Pipelines/Kubeflow 没有太多经验，因此我寻求帮助/建议，希望这只是一些初学者的错误。这是我用作管道的代码的要点

from google_cloud_pipeline_components import aiplatform as gcc_aip
from kfp.v2 import dsl

from kfp.v2.dsl import component
from kfp.v2.dsl import (
    Output,
    Artifact,
    Model,
)

PROJECT_ID = 'my-gcp-project'
BUCKET_NAME = "mybucket"
PIPELINE_ROOT = "{}/pipeline_root".format(BUCKET_NAME)


@component
def get_input_data() -> str:
    # getting data from API, save to Cloud Storage
    # return GS URI
    gcs_batch_input_path = 'gs://somebucket/file'
    return gcs_batch_input_path


@component(
    base_image="python:3.9",
    packages_to_install=['google-cloud-aiplatform==1.8.0']
)
def load_ml_model(project_id: str, model: Output[Artifact]):
    """Load existing Vertex model"""
    import google.cloud.aiplatform as aip

    model_id = '1234'
    model = aip.Model(model_name=model_id, project=project_id, location='us-central1')



@dsl.pipeline(
    name="batch-pipeline", pipeline_root=PIPELINE_ROOT,
)
def pipeline(gcp_project: str):
    input_data = get_input_data()
    ml_model = load_ml_model(gcp_project)

    gcc_aip.ModelBatchPredictOp(
        project=PROJECT_ID,
        job_display_name=f'test-prediction',
        model=ml_model.output,
        gcs_source_uris=[input_data.output],  # this doesn't work
        # gcs_source_uris=['gs://mybucket/output/'],  # hardcoded gs uri works
        gcs_destination_output_uri_prefix=f'gs://{PIPELINE_ROOT}/prediction_output/'
    )


if __name__ == '__main__':
    from kfp.v2 import compiler
    import google.cloud.aiplatform as aip
    pipeline_export_filepath = 'test-pipeline.json'
    compiler.Compiler().compile(pipeline_func=pipeline,
                                package_path=pipeline_export_filepath)
    # pipeline_params = {
    #     'gcp_project': PROJECT_ID,
    # }
    # job = aip.PipelineJob(
    #     display_name='test-pipeline',
    #     template_path=pipeline_export_filepath,
    #     pipeline_root=f'gs://{PIPELINE_ROOT}',
    #     project=PROJECT_ID,
    #     parameter_values=pipeline_params,
    # )

    # job.run()

运行管道时，它会在运行批量预测时抛出此异常：
details = "List of found errors: 1.Field: batch_prediction_job.model; Message: Invalid Model resource name. 所以我不确定可能出了什么问题。我尝试在笔记本中加载模型（在组件之外）并且它正确返回。

我遇到的第二个问题是引用 GCS URI 作为从组件到批处理作业输入的输出。

   input_data = get_input_data2()
   gcc_aip.ModelBatchPredictOp(
        project=PROJECT_ID,
        job_display_name=f'test-prediction',
        model=ml_model.output,
        gcs_source_uris=[input_data.output],  # this doesn't work
        # gcs_source_uris=['gs://mybucket/output/'],  # hardcoded gs uri works
        gcs_destination_output_uri_prefix=f'gs://{PIPELINE_ROOT}/prediction_output/'
    )

在编译过程中，我得到以下异常TypeError: Object of type PipelineParam is not JSON serializable，尽管我认为这可能是 ModelBatchPredictOp 组件的问题。

再次感谢任何帮助/建议，我从昨天开始处理这个问题，所以也许我错过了一些明显的东西。

我正在使用的库：

google-cloud-aiplatform==1.8.0  
google-cloud-pipeline-components==0.2.0  
kfp==1.8.10  
kfp-pipeline-spec==0.1.13  
kfp-server-api==1.7.1

UPDATE经过评论、一些研究和调整，用于参考模型：

@component
def load_ml_model(project_id: str, model: Output[Artifact]):
    region = 'us-central1'
    model_id = '1234'
    model_uid = f'projects/{project_id}/locations/{region}/models/{model_id}'
    model.uri = model_uid
    model.metadata['resourceName'] = model_uid

然后我可以按预期使用它：

batch_predict_op = gcc_aip.ModelBatchPredictOp(
        project=gcp_project,
        job_display_name=f'batch-prediction-test',
        model=ml_model.outputs['model'],
        gcs_source_uris=[input_batch_gcs_path],
gcs_destination_output_uri_prefix=f'gs://{BUCKET_NAME}/prediction_output/test'
    )

UPDATE 2对于 GCS 路径，解决方法是在组件外部定义路径并将其作为输入参数传递，例如（缩写）：

@dsl.pipeline(
    name="my-pipeline",
    pipeline_root=PIPELINE_ROOT,
)
def pipeline(
        gcp_project: str,
        region: str,
        bucket: str
):
    ts = datetime.datetime.now().strftime("%Y%m%d-%H%M%S")
    
    gcs_prediction_input_path = f'gs://{BUCKET_NAME}/prediction_input/video_batch_prediction_input_{ts}.jsonl'
    batch_input_data_op = get_input_data(gcs_prediction_input_path)  # this loads input data to GCS path

    batch_predict_op = gcc_aip.ModelBatchPredictOp(
        project=gcp_project,
        model=training_job_run_op.outputs["model"],
        job_display_name='batch-prediction',
        # gcs_source_uris=[batch_input_data_op.output],
        gcs_source_uris=[gcs_prediction_input_path],
        gcs_destination_output_uri_prefix=f'gs://{BUCKET_NAME}/prediction_output/',
    ).after(batch_input_data_op)  # we need to add 'after' so it runs after input data is prepared since get_input_data doesn't returns anything

仍然不确定，为什么当我返回 GCS 路径时它不起作用/编译get_input_data成分

我很高兴您解决了大部分主要问题并找到了模型声明的解决方法。

为您input.output观察gcs_source_uris，其背后的原因是因为函数/类返回值的方式。如果你深入研究以下类/方法google_cloud_pipeline_components你会发现它实现了一个允许你使用的结构.outputs来自被调用函数的返回值。

如果你去实现管道的一个组件，你会发现它返回一个输出数组convert_method_to_component功能。因此，为了在您的自定义类/函数中实现该功能，您的函数应该返回一个可以作为属性调用的值。下面是它的基本实现。

class CustomClass():
     def __init__(self):
       self.return_val = {'path':'custompath','desc':'a desc'}
      
     @property
     def output(self):
       return self.return_val 

hello = CustomClass()
print(hello.output['path'])

如果您想深入了解它，可以访问以下页面：

将方法转换为组件 https://github.com/bharathdsce/kubeflow/blob/fcd627714664956b2c280b0109b64633bc99fa05/components/google-cloud/google_cloud_pipeline_components/aiplatform/utils.py#L383，这是执行convert_method_to_component
特性 https://www.programiz.com/python-programming/property，Python 中属性的基础知识。

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

kubeflowpipelines

googlecloudvertexai

Vertex AI 模型批量预测、引用云存储上的现有模型和输入文件的问题的相关文章

SSR 如何与 PWA 结合

如何使用 PWA 渐进式 Web 应用程序进行 SSR 服务器端渲染据我了解 SSR SSR 运行时将加载页面并运行必要的脚本以将数据加载到页面上然后返回渲染后的html 这对于不会运行 javascript 的网络爬虫和无脚本的浏览
使用字符串访问属性

给定一个与对象字段同名的字符串如何获取对象字段的引用例如假设我向 GetFieldByStr 方法传入一个名为 field1 的字符串并且该对象具有字段名称 field1 那么如何获取对 field1 对象的引用我假设以某种方式使
使用 Ratchet\Push.js 加载页面后执行自定义脚本

所以在 GitHub 文档上棘轮2 0 2 https github com twbs ratchet我发现了以下说法包含 JavaScript 的脚本标签将不会在以下页面上执行加载了push js 如果您想将事件处理程序附加到其他页
负整数除法令人惊讶的结果

在我的应用程序中我遇到了以下情况并对结果感到惊讶 8 7 2 均为整数这是什么意思对于实际值即8 0 7 0 结果大致为 1 143 使用整数除法的结果将向下舍入到更负的值 2 这也称为楼层划分这就是为什么你会得到一些令人困惑的
如何获取字符串中单词的所有组合

我想获得字符串中所有相邻单词的组合例如细绳get all combinations我想要得到 get all combinations all combinations get all all get combinations 我写下一
SQL 多个 where 子句

我找不到像这样使用多个 where 子句是否有效我使用 JPA MySQL 我需要多个 where 子句其中一个在这里是 not 还是我遗漏了一些东西 select d from T DEBIT d where d status PEN

随机推荐

更改 UIView 的边界并使 UICollisionBehavior 对其做出反应

是否可以更改 UIView 的边界使用 UIAttachmentBehaviors 附加到其他一些 UIView 并使 UICollisionBehavior 与 UIAttachmentBehavior 相结合对其做出响应如此处的示例
在 Go 中解组通用 json [重复]

这个问题在这里已经有答案了我是一名新的 Go 程序员来自 Java 我想重现一种易于在 Java 中使用的通用方法我想创建一些函数允许我对 JSON 字符串执行 Unmarshal 以避免代码重复这是我当前不起作用的代码 type
java.sql.SQLException：连接已关闭

我们正在得到java sql SQLException 连接已关闭执行事务时间歇性出现异常我们使用的是 tomcat 7 X 下面是配置
.Net C# RESTSharp 10 分钟超时

我已将浏览器控件嵌入到 Net 表单中并将其编译为窗口的可执行文件浏览器控件正在显示我们的 HTML5 图像查看器该应用程序打开套接字以便它可以侦听来自各个服务器的推送请求这允许将图像推送到单个用户的桌面当传入图像推送请求时
熊猫：组内最大值和最小值之间的差异

给定一个如下所示的数据框 GROUP VALUE 1 5 2 2 1 10 2 20 1 7 我想计算每组内最大值和最小值之间的差异也就是说结果应该是 GROUP DIFF 1 5 2 18 在 Pandas 中执行此操作的简单方法是什
git 有一个损坏的丢失对象，无法修复

git commit error inflate data stream error incorrect data check error corrupt loose object 26f0654cde5d83f2ed8d971474d9d
如何在 Racer / DerbyJS 上创建服务器端应用程序逻辑？

我正在学习新的细节DerbyJS http derbyjs com堆栈我找不到将应用程序逻辑放在服务器端的方法声明的意图是所有代码都应该能够在服务器和客户端中运行但是我需要隐藏某些数据并且仅在根据用户会话信息进行身份验证时才将其发
Neo4j Spatial over REST 通过 JAVA API 不起作用（对我来说）

我正在 Neo4J 上开发 Grails 应用程序我还想将其导出为 GIS 数据库查看如何在 GeoServer uDig 中使用 neo4j 的示例似乎空间集成仅通过嵌入式 neo4j 数据库进行有谁知道是否可以进行设置以便我的
Rails 中漂亮（过时）的 RESTful URL

我希望我的网站具有如下所示的 URL example com 2010 02 my first post 我有我的Post模型与slug字段我的第一篇文章和published on字段我们将从中扣除 url 中的年份和月份部分我想要
无法从 Fargate 容器内访问 S3 存储桶（错误请求且无法找到凭据）

我创建了一个私有 s3 存储桶和一个 Fargate 集群其中包含一个简单的任务该任务尝试使用以下命令从该存储桶中读取数据 python 3 and boto3 我已经在 2 个不同的 docker 镜像上尝试过了在一个镜像上我得到了
通过键盘中断关闭所有线程

我在这里尝试做的是使用键盘中断来退出程序中所有正在进行的线程这是我的代码的精简版本其中创建了线程 for i in taskDictionary try sleep 60 thread Thread target mainModule
当使用 css3 比例缩放元素时，它会变得像素化，直到动画完成后。我正在为带有边框的元素设置动画

http jsfiddle net nicktheandroid 5Ytnj http jsfiddle net nicktheandroid 5Ytnj 当我添加 webkit backface visibility hidden to
将响应保存为文件

我有返回的 WebAPI 方法HttpResponseMessage with csv文件内容 private static HttpResponseMessage FileAsAttachment string file var now
使用 Alexa 技能进行 Node JS 回调

我有一个包含请求调用的模块但它似乎没有被执行 var request require request var Alexa require alexa sdk var APP ID
r-插入符包错误-createDataParition 没有观察到

当我尝试运行时出现以下错误createDataPartition在插入符号中 Error in createDataPartition data1 p 0 8 list FALSE y must have at least 2 data p
jQuery 对话框主题和样式

如何更改 jQuery 对话框标题栏的背景颜色我看过themeroller 但它似乎对我不起作用 Thanks 您可以通过修改 ui dialog titlebar CSS 类来更改它但我强烈建议您使用主题滚轮工具 http jquer
我可以将 Sailsjs 部署到 AppHarbor 或 Heroku 吗？

AppHarbor 使用 iisnode 支持 Node 我现在可以将 Sails js 应用程序部署到 AppHarbor吗如果可以如何部署老实说我不知道我在节点部署方面做了什么但我正在尝试遵循我在网上阅读的内容当我将代码部署
ACAccountCredential 为 oauthToken 返回 null

我通过以下方式访问用户的 Facebook accStore requestAccessToAccountsWithType fbAccountType options options completion BOOL granted NSE
如何在本地使用 Lambda Layers 测试 AWS SAM 应用程序？

我正在使用 AWS SAM 和 API Gateway 创建一个 API 以将请求传递到多个 Lambda 处理程序函数我在它们之间共享代码因此我想使用 Lambda 层来避免重写冗余代码在创建任何实际的 AWS 资源之前我首先在本
Vertex AI 模型批量预测、引用云存储上的现有模型和输入文件的问题

我正在努力正确设置 Vertex AI 管道该管道执行以下操作从 API 读取数据并存储到 GCS 并作为批量预测的输入获取现有模型 Vertex AI 上的视频分类使用点 1 的输入创建批量预测作业正如您将看到的我对 Vert

Vertex AI 模型批量预测、引用云存储上的现有模型和输入文件的问题

Vertex AI 模型批量预测、引用云存储上的现有模型和输入文件的问题 的相关文章

随机推荐

热门标签

Vertex AI 模型批量预测、引用云存储上的现有模型和输入文件的问题的相关文章