New Bridges and Automatic Helm Upgrade i... United Manufacturing Hub #general

New Bridges and Automatic Helm Upgrade i...

learn.umh.app

10/28/2024, 1:51 PM

Hi, we just released a new article on learn.umh.app. Go check it out, and discuss it here! Title: New Bridges and Automatic Helm Upgrade in UMH Excerpt: UMH introduces Benthos-based data bridges and automated Helm upgrades, enhancing reliability, maintainability and user experience. Explore how these updates simplify maintenance, optimize data flows, and enable custom data contracts. Link: https://learn.umh.app/course/new-bridges-and-automatic-helm-upgrade-in-umh/

Jermuk

10/28/2024, 1:52 PM

Must-Read for everyone upgrading the UMH, @Diederik , @trentc @Brian Pribe @DanielH

DanielH

10/28/2024, 3:51 PM

i was actually starting this morning with a bash script for helm upgrades, to automate it more with comparison of user changed values, current values against new chart version etc, to always have a clean chart.. How is this handled in the automatic upgrade?

Jermuk

10/28/2024, 3:57 PM

we decreased the amount of options in the chart, and will over time extend it again. the user values will be taken to the next version automatically from this version on

Jermuk

10/28/2024, 3:57 PM

maybe @Ferdinand has more infos

Ferdinand

10/28/2024, 4:22 PM

Currently we prefer user values over our values (basically --reuse-values).

trentc

10/28/2024, 4:50 PM

Did the upgrade, is there a reason why the RedPanda version "flickers"? It shows this message about the version, then it goes green to latest ver

Jermuk

10/28/2024, 4:51 PM

I am having the same on my instance, @Ferdinand @Athavan kanapuli :/

Jermuk

10/28/2024, 4:51 PM

(also factoryinsight is flickering)

trentc

10/28/2024, 4:52 PM

trentc

10/28/2024, 4:53 PM

quick way to revert?

Linear

10/28/2024, 4:59 PM

Issue [ENG-1833](https://linear.app/united-manufacturing-hub/issue/ENG-1833/redpanda-flickers-with-060) created.

Jermuk

10/28/2024, 4:59 PM

is this coming in the new version? can you do a top / htop on your instance to show us what is taking the most CPU?

trentc

10/28/2024, 5:00 PM

Yes in a few....i have a call

Jermuk

10/28/2024, 5:01 PM

i'll be leaving for sports in a couple of minutes, i dont know about @Ferdinand

Ferdinand

10/28/2024, 5:01 PM

Ill stay for a while

Diederik

10/28/2024, 5:38 PM

Thanks for the info! I will analyse this decently before updating, seems important on first sight!

Diederik

10/28/2024, 5:38 PM

*at first sight

Jermuk

10/28/2024, 5:50 PM

I know what is happening! The new bridges have new consumer groups. So they will start at the oldest message available in Kafka and go through all messages, e.g. send all messages again to MQTT and try to insert it into the database. Until it catches up, there is some high CPU usage

Jermuk

10/28/2024, 5:50 PM

Should have gone with oldest disabled @Ferdinand

Jermuk

10/28/2024, 5:50 PM

To prevent this behaviour

trentc

10/28/2024, 5:51 PM

Does that mean we will have duplicates in the tag table?

Ferdinand

10/28/2024, 5:51 PM

no, tag table automatically filters out duplicates before insertio

Ferdinand

10/28/2024, 5:51 PM

Jermuk

10/28/2024, 5:54 PM

It depending on your MQTT consumer logic in Node-RED, it might recreate a lot of messages causing duplicated messages. Maybe what we should do is hot fix the current helm version to disable the “from oldest” and provide a new version with that. So current users on the helm chart can upgrade, but people upgrading with larger instances don’t have this CPU issue

Jermuk

10/28/2024, 6:10 PM

@Athavan kanapuli @Ferdinand let’s do this tomorrow morning first thing

Diederik

10/28/2024, 10:47 PM

i will update my testsetup tomorrow around noon, feedback by tomorrow evening! If i see no weird things i will update my production setup

Jermuk

10/29/2024, 8:57 AM

maybe dont update yet, we need to add a couple of more fixes for people extensively using MQTT

Jermuk

10/29/2024, 8:59 AM

an update can currently put unnecessary stress on the overall system. it will not result in duplicated tags in the database, but self=created Node-RED flows might not be able to handle it properly. Also the additional stress on the instance could cause it to fail if it is already running on its limits (e.g., if you have already high disk pressure, upgrading could result in tempeorarily needing more storage space) therefore, we are currently fixing this "unnecessary stress" part

Diederik

10/29/2024, 9:05 AM

ok, let me know

DanielH

10/29/2024, 6:38 PM

ok. so this will still mean that if the chart changes you will have old keys that does not exist in the new chart right? i have spent quite some time now to create a bash script that let you choose namespace, chart and version to upgrade and then it downloads the chart temporary to get the default values, compare with the current values from the chart and extracts them to a new yaml file. this way it is easy to filter out what values you have changed. Next i will compare these values against the new chart version to see if the keys exists in the new chart.. It was not so easy to do with only bash scripting but i want it to be simple so it can run direct in the cli on flatcar.. i was also thinking about creating a simple pod running npm and have a small dashboard to handle this with the kubernetes api..

Jermuk

10/31/2024, 3:09 PM

i think this behaviour might be overwritten by our current logic, where you can specify the helm chart version in the mgmtcompanion-config, and the mgmtcompanion updates it then.

Jermuk

10/31/2024, 3:12 PM

Also in general we fixed the "unnecessary stress " bug. What can still happen is that if you have custom grafana settings (like changed the grafana.ini), custom Node-RED settings (like username/password) or topic retention in redpanda, these settings need to be re-applied after an update. also what I can offer you is that we can agree on a time frame next week where if you update and have any issues, we can quickly setup a meeting and help you. tomorrow is public holiday here in our region, so we could schedule slots from monday on.

DanielH

10/31/2024, 4:33 PM

ok. well, i am not worried about the update, currently my new setup is empty and i have vacation this week since the kids have Autumn leave.. What i mean is if you use the --reuse in helm it will put in all values from the current setup regardless of what is in the new chart. From my understanding if reading both the manual and forum threads it is better to reset the values and ensure you only apply valid values for the new chart. This is why i making a quite large bash script that reads out the changed values in the current chart, saves them to a file and check against the new chart version if the keys exists. This way you will see what values you have changed and also if some setting has been removed.

DanielH

10/31/2024, 4:35 PM

for custom settings there will be no problem since i will setup a completely new stack and move assets one by one to the new and lastly move the old data and convert it to the new model.

Previous Next