PPC905AE101 3BHE014070R0101 Troubleshooting of DCS in the production process



By
jonson
25 1 月 24
0
comment

Each manufacturer’s DCS has its own characteristics, so the analysis and handling of its fault phenomena are not the same. However, in summary, the obstacles caused by DCS to the unit can be divided into three categories:


(1) System issues, including design and installation defects, software and hardware failures, etc.
(2) Malfunctions caused by human factors, including personnel errors, inadequate management systems, and implementation of execution processes.
(3) DCS malfunction caused by external environmental issues in the system. Abnormal conditions are caused by factors such as high or low ambient temperature, humidity, dust, vibration, and small animals.
2.1 Examples of problems and malfunctions with DCS itself
This type of malfunction is more common in the production process, mainly including system design and installation defects, controller (DPU or CPU) crashes, network disconnection and other faults, operator station black screen, network communication blockage, software defects, low system configuration, and interface problems with other systems and equipment.
2.1.1 Power and grounding issues:
(1) The DCS power supply system of a certain power plant uses ABB Symphony III power supply, but during infrastructure construction, the cabinet installation is still carried out according to the grounding method of Type II power supply, which differs greatly from the grounding technology requirements of Type III power supply. Since the unit was put into operation, there have been multiple DCS module failures, signal jumps, and hardware burns, which are suspected to be related to the grounding system. Similarly, during the construction period of a certain power plant, there were problems with the design, production, and installation of the DCS grounding network. After the DCS system was operated, all temperature measurement points of the thermal resistance thermocouples experienced periodic fluctuations.
(2) A certain factory’s control system on the turbine side failed due to loose power connections.
Lesson learned: DCS does not have a good grounding system and reasonable cable shielding, which not only causes significant system interference, but also makes the control system prone to false signals and module damage. It can be seen that problems with UPS power supply and control system grounding will pose great risks to the safe and stable operation of DCS after the power plant is put into operation. Therefore, the power supply design of DCS system must have reliable backup means, and the load configuration should be reasonable and have a certain margin; The grounding of the DCS system must strictly comply with the technical requirements of the manufacturing plant (if there are no special instructions from the manufacturing plant, DLT774 regulations should be followed). All cables entering the DSC system control signal must use qualified shielded cables and be laid separately from the power cables with good single end grounding.
2.1.2 System configuration issues:
(1) Frequent failures and crashes of the DCS (T-ME/XP system) in a power plant in Zhejiang caused a unit shutdown accident. 7. 8 units (2 * 330MW) experienced a total of 22 DCS system failures and freezes from February 1997 to May, resulting in 8 abnormal trips of the units. Afterwards, there were multiple malfunctions in the operation screen (two times in Unit 8, all six operation stations experienced “black screens”), seriously threatening the safety of the unit. After analysis, it is believed that the DCS system has the following problems: (1) There are problems in the performance calculation software and switch redundancy configuration in the DCS engineering design. (2) Hardware configuration mismatch (including matching and communication issues between T-ME and T-XP systems). (3) Some hardware designs are incomplete. (4) Further analysis reveals that the critical CS275 (lower level T-ME) communication bus has a bottleneck problem due to its high load rate. Under the premise of reasonable configuration, European T-ME/XP system users have generally good usage of the T-ME/XP system.
(2) The DCS used in the automation transformation of the thermal control system of a 200MW unit in a certain power plant was found to have a load rate exceeding 90% and a soft manual operation response close to 1 minute, which was impossible to use due to inaccurate calculation of the load rate of the system configuration and the fact that the technical indicators were close to the allowable limit in order to reduce investment. Additionally, the system had a large number of virtual I/O points during operation, After significant adjustments (system re configuration), this issue was resolved.
(3) A 600MW unit in Northeast China had a low configuration made by the DCS manufacturer due to inadequate description of the isolation properties of the I/O channels in the bidding technical specifications. As a result, a large number of I/O boards were burned during debugging. Later, the isolation method and hardware were changed, and the power plant spent a lot of money, which also offset the original bidding price advantage. In addition, the quality and shielding issues of cables must also be highly valued, and computer specific shielded cables should be used for important signals and control. Many renovation projects have to re lay cables due to cable problems, which affects the construction period.
(4) The Xinhua XDPS-400 system engineer station of a 300MW unit in a certain power plant frequently crashes. After inspection, it was found that there are many operating programs, including multiple virtual DPUs, historical data records, performance calculations, reports, etc. Resolving the issue of allocating historical data to other human-machine interface stations.
2.1.3 Controller (DPU or CPU) malfunction
(1) The CPU of the FSSS1 control system in the HIACS-5000CM control system of a 300MW # 2 unit in a certain power plant failed, and the control right was not handed over. The CPU failed to switch to the main control, resulting in the inability of the control equipment in this part of the system to operate (the equipment remained in its original state of operation). When performing online replacement steps on the main CPU until power outage, switching from the CUP to the main control CPU, the system equipment is controlled, and everything is normal after replacing the original main control CPU.
(2) The problem of inconsistent data communication between different controllers in the same PCU cabinet of SYMPHONY produced by ABB at a certain time in the early stage was solved by upgrading the firmware;
(3) In the early stage of Xinhua Control XDPS system, a certain batch of DPUs had multiple offline and crash phenomena. After inspection, it was found that there were individual capacitor issues with the DPU card components. After upgrading and replacing the card components, the problem was resolved.
Due to the current redundant configuration of DCS controllers, the number of unit trips caused by abnormal operation of the main controller is greatly reduced. However, once a pair of redundant controllers crash simultaneously, it will directly threaten safety production. Measures must be taken to effectively avoid such situations.

发表回复