Announcement

**Guest** · July 22, 2019, 09:20 AM

Just a side note: When I am controlling numerous zwave devices, I will place a Wait command after about every 6 device commands. This allows Homeseer and ZWave to "catch up". I experimented to find how often to put wait statements in, and for how long to wait.

**Wade** · July 22, 2019, 09:26 AM

This would be an excellent addition to HS processing. I also see these types of errors regularly--say, several times per day. It occurs on my system not only when processing large groups of devices but also individual commands. I don't know the reason, but sometimes z-wave failure errors are logged even though the device does in fact receive and process the command although belatedly. A timing issue perhaps? Or is the system in fact logging an error then retrying?

As for problems with large groups of devices, I experienced it consistently back when trying to use the All Off z-wave command--to the point of the command being unusable. (I think I've read the All On / All Off functions may have been or will be deprecated?) I then moved to EasyTrigger groups but still have the issue of devices not receiving commands unless I use the more recent ET feature of inserting pauses between each device command--minimum 1 second. This results in successfully sending all commands perhaps 95% of the time, but in many cases the time it takes to cycle through a large number of devices is undesirable.

rjh I also would appreciate your consideration of jvm's thoughtful suggestion.

**jvm** · July 22, 2019, 09:27 AM

Originally posted by aa6vh View Post

Just a side note: When I am controlling numerous zwave devices, I will place a Wait command after about every 6 device commands. This allows Homeseer and ZWave to "catch up". I experimented to find how often to put wait statements in, and for how long to wait.

Thanks for the suggestion. Yes, that's one way to deal with it but highlights the problem that failure recovery is pushed to the user for ad-hoc solutions which can slow overall speed of the system (all those waits can add up if you have 100 devices to command) rather than failure recovery being handled in the plugin when possible. It seems to me the best "design philosophy" is to handle all predictable failure scenarios (and this is one) in the plugin rather than expecting each end-user to detect the error, diagnose why, and figure out an ad-hoc solution to address it.

**ericg** · July 22, 2019, 11:42 AM

+1, jvm. Excellent post. There seems to be ongoing contention for HS3/4 development resources. One camp (largely driven by marketing considerations, I suspect) is focussed on mobile app feature enhancements, while another group (of which I am a member) is much more interested in rock solid engine performance. To me, an occasional failure to execute an event, or script, or device command is simply intolerable. And anything less than 100% logging (if desired) of hardware and detected system failures compounds the injury. When it comes to the engine, "mostly" is simply not good enough.

A different, but somewhat related request I would like to add is improved system performance monitoring. In current context, the system should collect data on device command failure failure rate (by device, and aggregate), average and maximum times for command completion, etc. Data values would be assigned to virtual devices which could then drive user written events. Implemented properly, performance monitoring functionality would help identify incipient hardware failure before it becomes catastrophic.

Announcement

Command Failure When Controlling Large Number of Devices

Command Failure When Controlling Large Number of Devices

Comment

Comment

Comment

Comment