Enable canisters to make HTTP(S) requests

lastmjs · August 3, 2022, 7:38pm

So I’ve been working with http requests in the local replica, I have a question about cycle costs. It seems like you have to send a few hundred billion cycles with each http request call, why is that?

    const http_result: CanisterResult<HttpResponse> = yield ManagementCanister.http_request({
        url: ethereum_url,
        max_response_bytes: null,
        http_method: {
            POST: null
        },
        headers: [],
        body: utf8.fromString(JSON.stringify({
            jsonrpc: '2.0',
            method: 'eth_blockNumber',
            params: [],
            id: 83
        })),
        transform_method_name: 'eth_block_number_transform'
    }).with_cycles(300_000_000_000n); // TODO why is it asking for this many cycles?

dieter.sommer · August 3, 2022, 7:41pm

Hi @lastmjs!
Each HTTP request costs currently 400M cycles flat fee plus cycles per request and response byte. The max response size is taken as a parameter for charging and the default is 2MB which makes it really costly. You must set the max response size to something in the range you expect in order to not be charged insane amounts of cycles.

Hope that helps.

We are currently revising pricing, so the pricing for this feature may change. Currently it is rather conservatively (expensively) priced.

lastmjs · August 3, 2022, 7:52pm

I see, thanks for the clarification. Another quick question, my transform_method doesn’t seem to be necessary locally, requests are returning just fine even if the transform_method doesn’t do anything. Is the local replica in dfx 0.11.0 working like it will in production? I just don’t want to run into any nasty surprises

dieter.sommer · August 3, 2022, 8:05pm

Yes, correct observation.
Transform is there to ensure that when having >1 replicas, their respective responses are made the same. For example, responses often contain timestamps or other items that change between responses. This only becomes an issue in case you have more than 1 replica because then it may lead to no consensus being reached on different responses.
The behaviour between the dfx environment and IC deployment can vary a lot for this feature unfortunately, exactly because on IC mainnet all replicas of the subnet make the request and if the responses do differ in some parts you need a proper transform function to get the same responses and have the response go through consensus. There are some more pitfalls that I am currently writing up in the feature documentation to help folks not waste time things that we already know may cause problems.

Stay tuned, we are very close to finalizing this and releasing the documentation.

What are you working on if I may ask? Some form of Ethereum integration based on cloud nodes by any chance?

lastmjs · August 3, 2022, 8:19pm

What are you working on if I may ask? Some form of Ethereum integration based on cloud nodes by any chance?

Exactly! I’m getting Azle ready for outgoing http requests, and as part of that I’m writing some Ethereum examples pulling data (and hopefully writing data using POST requests) to Ethereum using a Web2 service.

So, I really think that the local replica needs to simulate the http consensus, otherwise it could be extremely difficult to figure out how to properly transform the data. It’s strange that this isn’t simulated, as my understanding is that the local replicas simulate the consensus delay for update calls so that developers aren’t surprised by mainnet. Also dfx 0.10+ has a local cycle environment that more closely resembles production.

Are there plans to get the local replica to work like production? All of the code I’m writing now, I doubt it will work once I deploy it.

dieter.sommer · August 3, 2022, 8:27pm

Currently, the single replica in the dfx environment will behave differently (not causing the problems one may run on IC mainnet) and we currently do not have a plan to change this as this would essentially mean a completely different architecture with multiple replicas in the dfx environment or implementing this “simulation” by hand. What we are planning to do is to provide documentation that mentions the pitfalls we are aware of, either by theory of by having run into them when writing the sample dApp. This should help folks already a lot.

You definitely need to analyze the responses by the service you are making requests to for variable response fields or just extract the data items you are interested in and throw away the remaining parts of the response. Pro tip: Also look at the response headers as they may contain timestamps.

lastmjs · August 3, 2022, 8:34pm

So do I understand correctly that the best way to debug this right now locally is to create a transform function and just log the HttpResponse that is the parameter to that function, looking for non-determinism?

dieter.sommer · August 3, 2022, 8:42pm

I would start by making the same request twice and diffing it to find the variable parts, both in body and headers. Then write the transform function based on the diff you observe. Then wait for the IC mainnet release and test it there. This should get you a solution that works immediately or get you close to that on mainnet if you proceed like this and work thoroughly.

Our own engineers ran into some problems in that area as well when writing example code, so you really need to get used to it and know about the pitfalls. You (and others) should have a good starting point with the information I gave you in the last few posts. But think of it that way: That’s the first time in history that a smart contract can make HTTP requests to Web 2.0 services. And you are one of the first people implementing such a smart contract. We are very much operating at the forefront of technology here.

lastmjs · August 3, 2022, 8:51pm

Let’s say I send a lot of cycles, much more than required based on the response size, will the IC refund me the cycles? Or if I send a lot of cycles will the system just take them all from me?

lastmjs · August 3, 2022, 8:55pm

I have a lot of questions actually, I don’t want to spam this thread. Is this the best place to ask these? I think they will be useful to others as well.

dieter.sommer · August 4, 2022, 6:19am

The system should refund the cycles, it only takes what it costs.

Yes, this is the best place, please ask them. They will also be valuable for the documentation as others will have the same. I will answer as time permits me to.

lastmjs · August 4, 2022, 4:07pm

Will POST requests be supported at launch along with GET requests?

dieter.sommer · August 4, 2022, 7:02pm

Yes, POST support is implemented already. Please note that all replicas will make the same POST call, so there must be some way to prevent it to be made 13 times on the server. The standard solution for this are idempotency keys.

lastmjs · August 4, 2022, 9:47pm

I’m running into some confusing behavior here. If I set max_response_bytes to null and I send 300_000_000_000 cycles with the http request, just to be sure, and my response ends up only being 200 bytes, will I get charged an outrageous amount, or will the system refund me everything that it didn’t need to use? Because in my local testing, if I set max_response_bytes to null and send 300_000_000_000 cycles, even though my response bodies are around 100 bytes, I’m getting charged like 1T cycles for 6 http requests.

I’m a little confused at why we even have to set the max_response_bytes and send cycles with the request. Can’t the IC just charge the canister based on what it used? Why can’t we get rid of these two requirements?

dieter.sommer · August 5, 2022, 6:52am

If you do not specify max_response_size, the system takes the default of 2MB. This results in 2M * 100M cycles to be charged for the response. This is around 200B, which is in line with what you are observing. Always set max_response_bytes to not be charged for the maximum response size for the HTTP request.

The reason we have the max_response_bytes is that it would be technically too much effort to charge what was actually going over the wire in terms of incoming responses (this information would also need to go through consensus). Thus, we decided to introduce the max_response_bytes parameter and always charge the max response size instead of the actual size. Thus it’s important to set the parameter to a value close the real value to not be overcharged.

The default way of charging is to send cycles along, it would be theoretically possible also to directly deduct. We decided to not do that (can’t recall the exact reason, but think it was compliance with the typical way of charging) and thus one has to send cycles. You can always send along the max and the system deducts what it costs and returns the rest, so it should be convenient enough.

skilesare · August 5, 2022, 12:39pm

Is this api available from motoko yet?

lastmjs · August 5, 2022, 1:17pm

It’s just a new method on the management canister, so if you create the types I assume yes.

skilesare · August 5, 2022, 1:37pm

oh…wow. So you have to send a request to the management canister to have an HTTP request made? Is that the final design? It is literally just called http_request on the management canister with the signature above?

dieter.sommer · August 5, 2022, 2:14pm

Yes!
Depending on the response behaviour of the server, you need to write a transformation function that makes sure that the transformed responses of all replicas are equal.

The interface specification of the method http_request is available here already:

github.com

dfinity/interface-spec/blob/master/spec/index.adoc#ic-method-http_request

= The Internet Computer Interface Specification
DFINITY Foundation
∞
// the following are ways to make this document work both in antora and stand alone
:example: example$
:partial: partial$

WARNING: You are looking at the `master` version of the document! If you are looking for implementation specification or documentation, look at one of the versions at https://docs.dfinity.systems/public/v/.

== Introduction

Welcome to _the Internet Computer_! We speak of “the” Internet Computer, because although under the hood a large number of physical computers are working together in a blockchain protocol, in the end we have the appearance of a single, shared, secure and world-wide accessible computer. Developers who want to build decentralized applications (or _dapps_ for short) that run on the Internet Computer blockchain and end-users who want to use those dapps need to know very little, if anything, about the underlying protocol. However, knowing some details about the interfaces that the Internet Computer exposes can allow interested developers and architects to take fuller advantages of the unique features that the Internet Computer provides.

=== Target audience

This document describes this _external_ view of the Internet Computer, i.e. the low-level interfaces it provides to dapp developers and users, and what will happen when they use these interfaces.

NOTE: While this document describes the external interface and behavior of the Internet Computer, it is not intended as end-user or end-developer documentation. Most developers will interact with the Internet Computer through additional tooling like the SDK, Canister Development Kits and Motoko. Please see https://sdk.dfinity.org/ for suitable documentation.

The target audience of this document are

This file has been truncated. show original

claudio · August 5, 2022, 2:48pm

I think the link to the relevant Candid interface is broken in that documents but can be found here:

github.com

dfinity/interface-spec/blob/master/spec/ic.did

type canister_id = principal;
type user_id = principal;
type wasm_module = blob;

type canister_settings = record {
  controllers : opt vec principal;
  compute_allocation : opt nat;
  memory_allocation : opt nat;
  freezing_threshold : opt nat;
};

type definite_canister_settings = record {
  controllers : vec principal;
  compute_allocation : nat;
  memory_allocation : nat;
  freezing_threshold : nat;
};

type http_header = record { name: text; value: text };

This file has been truncated. show original

I believe this should be useable from Motoko already with some selective importing of IC ManagementCanister methods, but it would be nice if someone could translate the Rust example Motoko.

Rust example appears to live here:

It would also make sense to wrap the functionality into a little Motoko library, as we do for IC randomness…

Topic		Replies	Views
Canister http request? Developers	1	562	June 25, 2022
Why will the IC be able to call external APIs while other blockchains can't do that? General	5	977	January 30, 2022
Make external (non-IC) HTTP request from backend canister? Programs & Applications	5	4711	May 7, 2021
Registering Web2 Webhooks Motoko	1	53	September 30, 2024
Non replicated HTTPS outcalls Developers	20	1378	August 30, 2024

Enable canisters to make HTTP(S) requests

Related topics