boaviztapi icon indicating copy to clipboard operation
boaviztapi copied to clipboard

Providing data for Azure virtual machines

Open bpetit opened this issue 2 years ago • 6 comments

Hi there,

Here is a PR aggregating data to feed the API with Microsoft Azure data for virtual machines (and supposed underlying hosts).

This should not be merged until we at least clean it up (for instance, our scripts are not PEP8 at all 😋) and improve the readme (+ that it is reviewed of course), but we wanted to allow contributors to review it in advance and give feedbacks.

I hope it is not too much noise until it is clean.

kudos @bdromard and ping to join the discussion :)

bpetit avatar Apr 04 '24 15:04 bpetit

We tried to assess how coherent the data is by comparing it to AWS instances, and are running analysis on how the different input data and hypothesis are important in the result.

So far, it seems coherent, so if anyone want to review the PR, that would be great :)

You can find results we generated for testing in results.csv and the script that generated those data in generate_impact_azure.

Let us know if we could do something more to make review easier.

Cheers

bpetit avatar Apr 08 '24 16:04 bpetit

Tried to explain a bit more the perimeter covered by the api's cloud/instance route here. I'd be happy to have your feedback on this :)

bpetit avatar Apr 16 '24 17:04 bpetit

Hi @da-ekchajzer @demeringo @samuelrince,

Data generated in result.csv seem coherent, not too far from data generated for AWS instances with the same script.

I think we could procede with a global review any time you want now.

Last thing I may do, but it could be in another PR, will be to include data for instances still missing, especially A and B series.

As said in the README, we thank GreenPixie for sponsoring this PR.

bpetit avatar Apr 17 '24 14:04 bpetit

Tried to explain a bit more the perimeter covered by the api's cloud/instance route here. I'd be happy to have your feedback on this :)

Great, particularly the diagram 👍

demeringo avatar May 24 '24 08:05 demeringo

@bpetit I have looked and tried the PR and might have found a misconfiguration for 'id=Dadsv5-Type1' in server.csv. In this PR this platform is defined with 112 vCPU and a SSD of 2400, but according the Microsoft page it should be 96 and 3600.

Am I right or do I misunderstand the configuration?

havontuur avatar Aug 05 '24 13:08 havontuur

@bpetit I have looked and tried the PR and might have found a misconfiguration for 'id=Dadsv5-Type1' in server.csv. In this PR this platform is defined with 112 vCPU and a SSD of 2400, but according the Microsoft page it should be 96 and 3600.

Am I right or do I misunderstand the configuration?

Hello @havontuur !

What is indicated for Dadsv5-Type1 in server.csv (or any other Azure ID in that table) is the configuration for the dedicated host identified with this name, as documented on this page for the vCPU and the RAM specifications.

On the issue of storage, the allocation remains a problem, as we have not found reliable information on the hardware devices present on dedicated hosts yet.

Were you referring to this page from the Azure documentation on virtual machines when you were mentioning 96 vCPU and 3600 GiB of SSD storage ?

bdromard avatar Aug 22 '24 15:08 bdromard