Transport Object Specifying Device Of Work Machine, Work Machine, Transport Object Specifying Method Of Work Machine, Method For Producing Complementary Model, And Dataset For Learning KAWAMOTO; Shun ; et al. [KOMATSU LTD.]

Transport Object Specifying Device Of Work Machine, Work Machine, Transport Object Specifying Method Of Work Machine, Method For Producing Complementary Model, And Dataset For Learning

KAWAMOTO; Shun ; et al.

Patent Application Summary

U.S. patent application number 17/260069 was filed with the patent office on 2021-09-02 for transport object specifying device of work machine, work machine, transport object specifying method of work machine, method for producing complementary model, and dataset for learning. The applicant listed for this patent is KOMATSU LTD.. Invention is credited to Shintaro HAMADA, Yosuke KAJIHARA, Shun KAWAMOTO.

Application Number	20210272315 17/260069
Document ID	/
Family ID	1000005638710
Filed Date	2021-09-02

United States Patent Application	20210272315
Kind Code	A1
KAWAMOTO; Shun ; et al.	September 2, 2021

TRANSPORT OBJECT SPECIFYING DEVICE OF WORK MACHINE, WORK MACHINE, TRANSPORT OBJECT SPECIFYING METHOD OF WORK MACHINE, METHOD FOR PRODUCING COMPLEMENTARY MODEL, AND DATASET FOR LEARNING

Abstract

A transport object specifying device of a work machine includes an image acquisition unit, a drop target specifying unit, a three-dimensional data generation unit, and a surface specifying unit. The image acquisition unit acquires a captured image showing a drop target of the work machine in which a transport object is dropped. The drop target specifying unit specifies a three-dimensional position of at least part of the drop target based on the captured image. The three-dimensional data generation unit generates depth data, which is three-dimensional data representing a depth of the captured image, based on the captured image. The surface specifying unit specifies a three-dimensional position of a surface of the transport object in the drop target by removing, from the depth data, a part corresponding to the drop target based on the three-dimensional position of the at least part of the drop target.

Inventors:

KAWAMOTO; Shun; (Tokyo, JP) ; HAMADA; Shintaro; (Tokyo, JP) ; KAJIHARA; Yosuke; (Tokyo, JP)

Applicant:

Name	City	State	Country	Type
KOMATSU LTD.	Tokyo		JP

Family ID:

1000005638710

Appl. No.:

17/260069

Filed:

July 19, 2019

PCT Filed:

July 19, 2019

PCT NO:

PCT/JP2019/028454

371 Date:

January 13, 2021

Current U.S. Class:	1/1
Current CPC Class:	G06T 7/593 20170101; G06T 2207/30252 20130101; G06T 7/73 20170101; G06T 2207/10012 20130101
International Class:	G06T 7/73 20060101 G06T007/73; G06T 7/593 20060101 G06T007/593

Foreign Application Data

Date	Code	Application Number
Aug 31, 2018	JP	2018-163671

Claims

1. A transport object specifying device of a work machine, the transport object specifying device comprising: an image acquisition unit that acquires a captured image showing a drop target of the work machine in which a transport object is dropped; a drop target specifying unit that specifies a three-dimensional position of at least part of the drop target based on the captured image; a three-dimensional data generation unit that generates depth data, which is three-dimensional data representing a depth of the captured image, based on the captured image; and a surface specifying unit that specifies a three-dimensional position of a surface of the transport object in the drop target by removing, from the depth data, a part corresponding to the drop target based on the three-dimensional position of the at least part of the drop target.

2. The transport object specifying device according to claim 1, further comprising: a feature point specifying unit that specifies a position of a feature point of the drop target based on the captured image, the drop target specifying unit specifying the three-dimensional position of the at least part of the drop target based on the position of the feature point.

3. The transport object specifying device according to claim 1, wherein the drop target specifying unit specifies the three-dimensional position of the at least part of the drop target based on the captured image and a target model, which is a three-dimensional model indicating a shape of the drop target.

4. The transport object specifying device according to claim 1, wherein the surface specifying unit extracts, from the depth data, a three-dimensional position in a prismatic area which, is surrounded by a wall portion of the drop target and extends in a height direction of the wall portion, and removes the part corresponding to the drop target in the extracted three-dimensional position to specify the three-dimensional position of the surface of the transport object.

5. The transport object specifying device according to claim 1, further comprising: a distribution specifying unit that generates distribution information indicating a distribution of an amount of the transport object in the drop target based on the three-dimensional position of the surface of the transport object in the drop target and the three-dimensional position of the at least part of the drop target.

6. The transport object specifying device according to claim 5, further comprising: a distribution estimation unit that estimates a distribution of an amount of the transport object in a shielded part of the distribution information shielded by an obstacle.

7. The transport object specifying device according to claim 6, wherein the distribution estimation unit inputs the distribution information generated by the distribution specifying unit to a complementary model to generate distribution information that complements a value of the shielded part, and the complementary model is a trained model, which, when distribution information with some missing values is input, outputs distribution information that complements the missing values.

8. The transport object specifying device according to claim 6, wherein the distribution estimation unit generates distribution information that complements a value of the shielded part based on a rate of change or a pattern of change in a three-dimensional position of the transport object near the shielded part.

9. The transport object specifying device according to claim 6, wherein the distribution estimation unit estimates a distribution of an amount of the transport object in the shielded part based on a type of the transport object.

10. The transport object specifying device according to claim 1, wherein the captured image is a stereo image including at least a first image and a second image captured by a stereo camera.

11. A work machine including the transport object specifying device according to claim 1, the work machine further comprising: work equipment usable to transport the transport object; an imaging device; and a display device that displays information about the transport object in the drop target specified by the transport object specifying device.

12. A transport object specifying method of a work machine, comprising: acquiring a captured image showing a drop target of the work machine in which a transport object is dropped; specifying a three-dimensional position of at least part of the drop target based on the captured image; generating depth data, which is three-dimensional data representing a depth of the captured image, based on the captured image; and removing, from the depth data, a part corresponding to the drop target based on the three-dimensional position of the at least part of the drop target to specify a three-dimensional position of a surface of the transport object in the drop target.

13. A method for producing a complementary model, which, when distribution information with some missing values is input, outputs distribution information that complements the missing values, the method comprising: acquiring distribution information indicating a distribution of an amount of a transport object in a drop target of a work machine and incomplete distribution information in which some values of the distribution information are missing, as a dataset for learning; and training the complementary model using the dataset for learning such that when the incomplete distribution information is used as an input value, the distribution information becomes an output value.

14. (canceled)

Description

CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] This application is a U.S. National stage application of International Application No. PCT/JP2019/028454, filed on Jul. 19, 2019. This U.S. National stage application claims priority under 35 U.S.C. .sctn. 119(a) to Japanese Patent Application No. 2018-163671, filed in Japan on Aug. 31, 2018, the entire contents of which are hereby incorporated herein by reference.

BACKGROUND

Field of the Invention

[0002] The present invention relates to a transport object specifying device of a work machine, a work machine, a transport object specifying method of a work machine, a method for producing a complementary model, and a dataset for learning.

Background Information

[0003] Japanese Unexamined Patent Application, First Publication No. 2001-71809 discloses a technique of calculating a position of the center of gravity of a transport object based on the output of a weighting sensor provided on a transport vehicle and displaying a loaded state of the transport object.

SUMMARY

[0004] In the method described in Japanese Unexamined Patent Application, First Publication No. 2001-71809, the position of the center of gravity of a drop target such as the transport vehicle can be determined, but a three-dimensional position of the transport object in the drop target cannot be specified.

[0005] An object of the present invention is to provide a transport object specifying device of a work machine, a work machine, a transport object specifying method of a work machine, a method for producing a complementary model, and a dataset for learning capable of specifying a three-dimensional position of a transport object in a drop target.

[0006] According to one aspect of the present invention, a transport object specifying device of a work machine includes an image acquisition unit that acquires a captured image showing a drop target of the work machine in which a transport object is dropped, a drop target specifying unit that specifies a three-dimensional position of at least part of the drop target based on the captured image, a three-dimensional data generation unit that generates depth data which is three-dimensional data representing a depth of the captured image, based on the captured image, and a surface specifying unit that specifies a three-dimensional position of a surface of the transport object in the drop target by removing, from the depth data, a part corresponding to the drop target based on the three-dimensional position of the at least part of the drop target.

[0007] According to at least one of the above aspects, the transport object specifying device can specify the distribution of the transport object in the drop target.

BRIEF DESCRIPTION OF DRAWINGS

[0008] FIG. 1 is a diagram showing a configuration of a loading place according to one embodiment.

[0009] FIG. 2 is an external view of a hydraulic excavator according to one embodiment.

[0010] FIG. 3 is a schematic block diagram showing a configuration of a control device according to a first embodiment.

[0011] FIG. 4 is a diagram showing an example of a configuration of a neural network.

[0012] FIG. 5 is an example of guidance information.

[0013] FIG. 6 is a flowchart showing a display method of the guidance information by the control device according to the first embodiment.

[0014] FIG. 7 is a flowchart showing a learning method of a feature point specifying model according to the first embodiment.

[0015] FIG. 8 is a flowchart showing a learning method of a complementary model according to the first embodiment.

[0016] FIG. 9 is a schematic block diagram showing a configuration of a control device according to a second embodiment.

[0017] FIG. 10 is a flowchart showing a display method of guidance information by the control device according to the second embodiment.

[0018] FIG. 11A is a diagram showing a first example of a method for calculating an amount of a transport object in a dump body.

[0019] FIG. 11B is a diagram showing a second example of the method for calculating the amount of the transport object in the dump body.

DETAILED DESCRIPTION OF EMBODIMENT(S)

First Embodiment

[0020] Hereinafter, embodiments will be described in detail with reference to drawings.

[0021] FIG. 1 is a diagram showing a configuration of a loading place according to one embodiment.

[0022] At a construction site, a hydraulic excavator 100 which is a loading machine and a dump truck 200 which is a transport vehicle are provided. The hydraulic excavator 100 scoops a transport object L such as earth from the construction site and loads the transport object in the dump truck 200. The dump truck 200 transports the transport object L loaded by the hydraulic excavator 100 to a predetermined earth removable place. The dump truck 200 includes a dump body 210 which is a container for accommodating the transport object L. The dump body 210 is an example of a drop target in which the transport object L is dropped.

(Configuration of Hydraulic Excavator)

[0023] FIG. 2 is an external view of a hydraulic excavator according to one embodiment.

[0024] The hydraulic excavator 100 includes work equipment 110 that is hydraulically operated, a swing body 120 that supports the work equipment 110, and a travel body 130 that supports the swing body 120.

[0025] The swing body 120 is provided with a cab 121 in which an operator rides. The cab 121 is provided in a front portion of the swing body 120 and is positioned on a left-side (+Y side) of the work equipment 110.

<<Control System of Hydraulic Excavator>>

[0026] The hydraulic excavator 100 includes a stereo camera 122, an operation device 123, a control device 124, and a display device 125.

[0027] The stereo camera 122 is provided in an upper portion of the cab 121. The stereo camera 122 is installed in an upper (+Z direction) and front (+X direction) portion of the cab 121. The stereo camera 122 captures an image in front (+X direction) of the cab 121 through a windshield on a front surface of the cab 121. The stereo camera 122 includes at least one pair of cameras.

[0028] The operation device 123 is provided inside the cab 121. The operation device 123 is operated by the operator to supply hydraulic oil to an actuator of the work equipment 110.

[0029] The control device 124 acquires information from the stereo camera 122 to generate guidance information indicating a distribution of the transport object in the dump body 210 of the dump truck 200. The control device 124 is an example of a transport object specifying device.

[0030] The display device 125 displays the guidance information generated by the control device 124.

[0031] The hydraulic excavator 100 according to another embodiment may not necessarily include the stereo camera 122 and the display device 125.

<<Configuration of Stereo Camera>>

[0032] In the first embodiment, the stereo camera 122 includes a right-side camera 1221 and a left-side camera 1222. Examples of each camera include a camera using a charge coupled device (CCD) sensor and a complementary metal oxide semiconductor (CMOS) sensor.

[0033] The right-side camera 1221 and the left-side camera 1222 are installed at an interval in a left-right direction (Y-axis direction) such that optical axes of the cameras 1221 and 1222 are substantially parallel to a floor surface of the cab 121. The stereo camera 122 is an example of an imaging device. The control device 124 can calculate a distance between the stereo camera 122 and a captured target by using an image captured by the right-side camera 1221 and an image captured by the left-side camera 1222. Hereinafter, the image captured by the right-side camera 1221 is also referred to as a right-eye image. The image captured by the left-side camera 1222 is also referred to as a left-eye image. A combination of the images captured by respective cameras of the stereo camera 122 is also referred to as a stereo image. In another embodiment, the stereo camera 122 may be configured of three or more cameras.

(Configuration of Control Device)

[0034] FIG. 3 is a schematic block diagram showing a configuration of the control device according to the first embodiment.

[0035] The control device 124 includes a processor 91, a main memory 92, a storage 93, and an interface 94.

[0036] The storage 93 stores a program for controlling the work equipment 110. Examples of the storage 93 include a hard disk drive (HDD) and a non-volatile memory. The storage 93 may be an internal medium directly connected to a bus of the control device 124, or may be an external medium connected to the control device 124 through the interface 94 or a communication line. The storage 93 is an example of a storage unit.

[0037] The processor 91 reads the program from the storage 93, expands the program in the main memory 92, and executes processing according to the program. The processor 91 secures a storage area in the main memory 92 according to the program. The interface 94 is connected to the stereo camera 122, the display device 125, and other peripheral devices, and transmits and receives signals. The main memory 92 is an example of the storage unit.

[0038] With the execution of the program, the processor 91 includes a data acquisition unit 1701, a feature point specifying unit 1702, a three-dimensional data generation unit 1703, a dump body specifying unit 1704, a surface specifying unit 1705, a distribution specifying unit 1706, a distribution estimation unit 1707, a guidance information generation unit 1708, and a display control unit 1709. The storage 93 stores a camera parameter CP, a feature point specifying model M1, a complementary model M2, and a dump body model VD. The camera parameter CP is information indicating a position relationship between the swing body 120 and the right-side camera 1221 and a position relationship between the swing body 120 and the left-side camera 1222. The dump body model VD is a three-dimensional model representing a shape of the dump body 210. In another embodiment, three-dimensional data representing a shape of the dump truck 200 may be used instead of the dump body model VD. The dump body model VD is an example of a target model.

[0039] The program may be for realizing part of functions to be exerted by the control device 124. For example, the program may exert a function by a combination with another program already stored in the storage 93 or a combination with another program installed in another device. In another embodiment, the control device 124 may include a custom large scale integrated circuit (LSI) such as a programmable logic device (PLD) in addition to or instead of the above configuration. Examples of the PLD include a programmable array logic (PAL), a generic array logic (GAL), a complex programmable logic device (CPLD), and a field programmable gate array (FPGA). In this case, some or all of the functions realized by the processor may be realized by the integrated circuit.

[0040] The data acquisition unit 1701 acquires the stereo image from the stereo camera 122 through the interface 94. The data acquisition unit 1701 is an example of an image acquisition unit. In another embodiment, in a case where the hydraulic excavator 100 does not include the stereo camera 122, the data acquisition unit 1701 may acquire a stereo image from a stereo camera provided in another work machine, a stereo camera installed at the construction site, or the like.

[0041] The feature point specifying unit 1702 inputs the right-eye image of the stereo image acquired by the data acquisition unit 1701 to the feature point specifying model M1 stored in the storage 93 to specify positions of a plurality of feature points of the dump body 210 shown in the right-eye image. Examples of the feature point of the dump body 210 include upper and lower ends of a front panel of the dump body 210, an intersection of a guard frame of the front panel and a side gate, and upper and lower ends of a fixed post of a tailgate. That is, the feature point is an example of a predetermined position of the drop target.

[0042] The feature point specifying model M1 includes a neural network 140 shown in FIG. 4. FIG. 4 is a diagram showing an example of a configuration of the neural network. The feature point specifying model M1 is realized by, for example, a trained model of deep neural network (DNN). The trained model is configured of a combination of a training model and a trained parameter.

[0043] As shown in FIG. 4, the neural network 140 includes an input layer 141, one or more intermediate layers 142 (hidden layers), and an output layer 143. Each of the layers 141, 142, and 143 includes one or more neurons. The number of neurons in the intermediate layer 142 can be set as appropriate. The output layer 143 can be set as appropriate according to the number of feature points.

[0044] Neurons in the layers adjacent to each other are connected to each other, and a weight (connection load) is set for each connection. The number of connected neurons may be set as appropriate. A threshold value is set for each neuron, and an output value of each neuron is determined by whether or not a sum of products of an input value and the weight for each neuron exceeds the threshold value.

[0045] An image showing the dump body 210 of the dump truck 200 is input to the input layer 141. For each pixel of the image, an output value indicating a probability of the pixel being the feature point is output to the output layer 143. That is, the feature point specifying model M1 is a trained model which is trained, when an image showing the dump body 210 is input, to output the positions of the feature points of the dump body 210 in the image. The feature point specifying model M1 is trained by using, for example, a dataset for learning with an image showing the dump body 210 of the dump truck 200 as training data and with an image obtained by plotting the positions of the feature points of the dump body 210 as teaching data. The teaching data is an image in which a pixel related to the plot has a value indicating that the probability of the pixel being the feature point is 1, and other pixel has a value indicating that the probability of the pixel being the feature point is 0. The teaching data may be information of which a pixel related to the plot has a value indicating that the probability of the pixel being the feature point is 1, and other pixel has a value indicating that the probability of the pixel being the feature point is 0, and may not be an image. In the present embodiment, "training data" refers to data input to the input layer during training of the training model. In the present embodiment, "teaching data" is data which is a correct answer for comparison with the value of the output layer of the neural network 140. In the present embodiment, "dataset for learning" refers to a combination of the training data and the teaching data. The trained parameters of the feature point specifying model M1 obtained by training are stored in the storage 93. The trained parameters include, for example, the number of layers of the neural network 140, the number of neurons in each layer, the connection relationship between the neurons, the weight of each connection between the neurons, and the threshold value of each neuron. For example, the same or similar DNN configuration as a DNN configuration used for detecting a facial organ or a DNN configuration used for estimating a posture of a person can be used as the configuration of the neural network 140 of the feature point specifying model M1. The feature point specifying model M1 is an example of a position specifying model. The feature point specifying model M1 according to another embodiment may be trained by unsupervised learning or reinforcement learning.

[0046] The three-dimensional data generation unit 1703 generates a three-dimensional map representing a depth in an imaging range of the stereo camera 122 by stereo measurement using the stereo image and the camera parameters stored in the storage 93. Specifically, the three-dimensional data generation unit 1703 generates point group data indicating a three-dimensional position by the stereo measurement of the stereo image. The point group data is an example of depth data. In another embodiment, the three-dimensional data generation unit 1703 may generate an elevation map generated from the point group data as three-dimensional data instead of the point group data.

[0047] The dump body specifying unit 1704 specifies a three-dimensional position of the dump body 210 based on the positions of the feature points specified by the feature point specifying unit 1702, the point group data specified by the three-dimensional data generation unit 1703, and the dump body model VD. Specifically, the dump body specifying unit 1704 specifies three-dimensional positions of the feature points based on the positions of the feature points specified by the feature point specifying unit 1702 and the point group data specified by the three-dimensional data generation unit 1703. Next, the dump body specifying unit 1704 fits the dump body model VD to the three-dimensional positions of the feature points to specify the three-dimensional position of the dump body 210. In another embodiment, the dump body specifying unit 1704 may specify the three-dimensional position of the dump body 210 based on the elevation map.

[0048] The surface specifying unit 1705 specifies a three-dimensional position of a surface of the transport object L on the dump body 210 based on the point group data generated by the three-dimensional data generation unit 1703 and the three-dimensional position of the dump body 210 specified by the dump body specifying unit 1704. Specifically, the surface specifying unit 1705 cuts out a part above a bottom surface of the dump body 210 from the point group data generated by the three-dimensional data generation unit 1703 to specify the three-dimensional position of the surface of the transport object L on the dump body 210.

[0049] The distribution specifying unit 1706 generates a dump body map indicating a distribution of an amount of the transport object L on the dump body 210 based on the three-dimensional position of the bottom surface of the dump body 210 specified by the dump body specifying unit 1704 and the three-dimensional position of the surface of the transport object L specified by the surface specifying unit 1705. The dump body map is an example of distribution information. The dump body map is, for example, an elevation map of the transport object L with reference to the bottom surface of the dump body 210.

[0050] The distribution estimation unit 1707 generates a dump body map in which a value is complemented for a part of the dump body map that does not have a value of height data. That is, the distribution estimation unit 1707 estimates a three-dimensional position of a shielded part of the dump body map that is shielded by an obstacle to update the dump body map. Examples of the obstacle include the work equipment 110, the tailgate of the dump body 210, and the transport object L.

[0051] Specifically, the distribution estimation unit 1707 inputs the dump body map into the complementary model M2 stored in the storage 93 to generate a dump body map in which the height data is complemented. The complementary model M2 is realized by, for example, a trained model of DNN including the neural network 140 shown in FIG. 4. The complementary model M2 is a trained model which is trained, when a dump body map including a grid without the height data is input, to output a dump body map in which all grids have the height data. For example, the complementary model M2 is trained with a combination of a complete dump body map in which all grids have the height data, which is generated by simulation or the like, and an incomplete dump body map in which part of the height data is removed from the complete dump body map, as a dataset for learning. The complementary model M2 according to another embodiment may be trained by unsupervised learning or reinforcement learning.

[0052] The guidance information generation unit 1708 generates the guidance information from the dump body map generated by the distribution estimation unit 1707.

[0053] FIG. 5 is an example of the guidance information. As shown in FIG. 5, for example, the guidance information generation unit 1708 generates the guidance information for displaying a two-dimensional heat map indicating a distribution of the height from the bottom surface of the dump body 210 to the surface of the transport object L. Granularity of vertical and horizontal divisions in the heat map shown in FIG. 5 is an example and is not limited thereto in another embodiment. The heat map according to another embodiment may represent, for example, a ratio of a height of the transport object L to a height related to an upper limit of the loading of the dump body 210.

[0054] The display control unit 1709 outputs a display signal for displaying the guidance information to the display device 125.

[0055] The learning unit 1801 performs learning processing of the feature point specifying model M1 and the complementary model M2. The learning unit 1801 may be provided in a device separate from the control device 124. In this case, the trained model which has been trained in the separate device will be recorded in the storage 93.

<<Display Method>>

[0056] FIG. 6 is a flowchart showing a display method of the guidance information by the control device according to the first embodiment.

[0057] First, the data acquisition unit 1701 acquires the stereo image from the stereo camera 122 (step S1). Next, the feature point specifying unit 1702 inputs the right-eye image of the stereo image acquired by the data acquisition unit 1701 to the feature point specifying model M1 stored in the storage 93 to specify the positions of the plurality of feature points of the dump body 210 shown in the right-eye image (step S2). Examples of the feature point of the dump body 210 include the upper and lower ends of the front panel of the dump body 210, the intersection of the guard frame of the front panel and the side gate, and the upper and lower ends of the fixed post of the tailgate. In another embodiment, the feature point specifying unit 1702 may input the left-eye image to the feature point specifying model M1 to specify the positions of the plurality of feature points.

[0058] The three-dimensional data generation unit 1703 generates the point group data of the entire imaging range of the stereo camera 122 by the stereo measurement using the stereo image acquired in step S1 and the camera parameters stored in the storage 93 (step S3).

[0059] The dump body specifying unit 1704 specifies the three-dimensional positions of the feature points based on the positions of the feature points specified in step S2 and the point group data generated in step S3 (step S4). For example, the dump body specifying unit 1704 specifies, using the point group data, a three-dimensional point corresponding to the pixel showing the feature point in the right-eye image to specify the three-dimensional position of the feature point. The dump body specifying unit 1704 fits the dump body model VD stored in the storage 93 to the specified positions of the feature points to specify the three-dimensional position of the dump body 210 (step S5). At this time, the dump body specifying unit 1704 may convert a coordinate system of the point group data into a dump body coordinate system having a corner of the dump body 210 as the origin, based on the three-dimensional position of the dump body 210. The dump body coordinate system can be represented as, for example, a coordinate system composed of an X-axis extending in a width direction of the front panel, a Y-axis extending in a width direction of the side gate, and a Z-axis extending in a height direction of the front panel, with a lower left end of the front panel as the origin. The dump body specifying unit 1704 is an example of a drop target specifying unit.

[0060] The surface specifying unit 1705 extracts, from the point group data generated in step S3, a plurality of three-dimensional points in a prismatic area, which is surrounded by the front panel, the side gate, and the tailgate of the dump body 210 specified in step S5 and extends in the height direction of the front panel, to remove three-dimensional points corresponding to the background from the point group data (step S6). The front panel, the side gate, and the tailgate form a wall portion of the dump body 210. In a case where the point group data is converted into the dump body coordinate system in step S5, the surface specifying unit 1705 sets threshold values determined based on a known size of the dump body 210 on the X-axis, the Y-axis, and the Z-axis to extract three-dimensional points in an area defined from the thresholds. The height of the prismatic area may be equal to the height of the front panel or may be higher than the height of the front panel by a predetermined length. The transport object L can be accurately extracted even in a case where the transport object L is loaded higher than the height of the dump body 210 by making a height of the prismatic area higher than that of the front panel. The prismatic area may be an area narrowed inward by a predetermined distance from the area surrounded by the front panel, the side gate, and the tailgate. In this case, even though the dump body model VD is a simple 3D model in which thicknesses of the front panel, the side gate, the tailgate, and the bottom surface are not accurate, an error in the point group data can be reduced.

[0061] The surface specifying unit 1705 removes three-dimensional points corresponding to the position of the dump body model VD from the plurality of three-dimensional points extracted in step S6 to specify the three-dimensional position of the surface of the transport object L loaded on the dump body 210 (step S7). The distribution specifying unit 1706 generates the dump body map which is an elevation map representing the height in the height direction of the front panel with the bottom surface of the dump body 210 as a reference height, based on the plurality of three-dimensional points extracted in step S6 and the bottom surface of the dump body 210 (step S8). The dump body map may include a grid without the height data. In a case where the point group data is converted into the dump body coordinate system in step S5, the distribution specifying unit 1706 can generate the dump body map by obtaining an elevation map with an XY plane as the reference height and with the Z-axis direction as the height direction.

[0062] The distribution estimation unit 1707 inputs the dump body map generated in step S7 into the complementary model M2 stored in the storage 93 to generate the dump body map in which the height data is complemented (step S8). The guidance information generation unit 1708 generates the guidance information shown in FIG. 5 based on the dump body map (step S9). The display control unit 1709 outputs the display signal for displaying the guidance information to the display device 125 (step S10).

[0063] Depending on the embodiment, the processing of steps S2 to S4 and steps S7 to S10 among the processing by the control device 124 shown in FIG. 6 may not be executed.

[0064] Instead of the processing of steps S3 and S4 among the processing by the control device 124 shown in FIG. 6, the positions of the feature points in the left-eye image may be specified from the positions of the feature points in the right-eye image by the stereo matching to specify the three-dimensional positions of the feature points using triangulation. Instead of the process of step S6, point group data only in the prismatic area which is surrounded by the front panel, the side gate, and the tailgate of the dump body 210 specified in step S5 and extends in the height direction of the front panel may be generated. In this case, since it is not necessary to generate the point group data of the entire imaging range, the calculation load can be reduced.

(Learning Method)

[0065] FIG. 7 is a flowchart showing a learning method of the feature point specifying model M1 according to the first embodiment. The data acquisition unit 1701 acquires the training data (step S101). For example, the training data in the feature point specifying model M1 is an image showing the dump body 210. The training data may be acquired from an image captured by the stereo camera 122. The training data may be acquired from an image captured by another work machine. An image showing a work machine different from the dump truck, for example, an image showing a dump body of a wheel loader may be used as the training data. It is possible to improve robustness of dump body recognition by using dump bodies of various types of work machines as the training data.

[0066] Next, the learning unit 1801 performs training of the feature point specifying model M1. The learning unit 1801 performs training of the feature point specifying model M1 using the combination of the training data acquired in step S101 and the teaching data which is the image obtained by plotting the positions of the feature points of the dump body, as the dataset for learning (step S102). For example, the learning unit 1801 uses the training data as an input to perform calculation processing of the neural network 140 in a forward propagation direction. Accordingly, the learning unit 1801 obtains an output value output from the output layer 143 of the neural network 140. The dataset for learning may be stored in the main memory 92 or the storage 93. Next, the learning unit 1801 calculates an error between the output value from the output layer 143 and the teaching data. The output value from the output layer 143 is a value representing the probability of a pixel being the feature point, and the teaching data is the information obtained by plotting the position of the feature point. The learning unit 1801 calculates an error of the weight of each connection between the neurons and an error of the threshold value of each neuron by backpropagation from the calculated error of the output value. The learning unit 1801 updates the weight of each connection between the neurons and the threshold value of each neuron based on the calculated errors.

[0067] The learning unit 1801 determines whether or not the output value from the feature point specifying model M1 matches the teaching data (step S103). It may be determined that the output value matches the teaching data when an error between the output value and the teaching data is within a predetermined value. In a case where the output value from the feature point specifying model M1 does not match the teaching data (step S103: NO), the above processing is repeated until the output value from the feature point specifying model M1 matches the teaching data. As a result, the parameters of the feature point specifying model M1 are optimized, and the feature point specifying model M1 can be trained.

[0068] In a case where the output value from the feature point specifying model M1 matches a value corresponding to the feature point (step S103: YES), the learning unit 1801 records the feature point specifying model M1 as a trained model including the parameters optimized by the training in the storage 93 (step S104).

[0069] FIG. 8 is a flowchart showing a learning method of the complementary model according to the first embodiment. The data acquisition unit 1701 acquires the complete dump body map in which all grids have the height data as teaching data (step S111). The complete dump body map is generated, for example, by simulation or the like. The learning unit 1801 randomly removes a part of the height data of the complete dump body map to generate the incomplete dump body map as training data (step S112).

[0070] Next, the learning unit 1801 performs training of the complementary model M2. The learning unit 1801 performs training of the complementary model M2 with the combination of the training data generated in step S112 and the teaching data acquired in step S111 as the dataset for learning (step S113). For example, the learning unit 1801 uses the training data as an input to perform calculation processing of the neural network 140 in a forward propagation direction. Accordingly, the learning unit 1801 obtains an output value output from the output layer 143 of the neural network 140. The dataset for learning may be stored in the main memory 92 or the storage 93. Next, the learning unit 1801 calculates an error between the dump body map output from the output layer 143 and the complete dump body map as the teaching data. The learning unit 1801 calculates an error of the weight of each connection between the neurons and an error of threshold value of each neuron by backpropagation from the calculated error of the output value. The learning unit 1801 updates the weight of each connection between the neurons and the threshold value of each neuron based on the calculated errors.

[0071] The learning unit 1801 determines whether or not the output value from the complementary model M2 matches the teaching data (step S114). It may be determined that the output value matches the teaching data when an error between the output value and the teaching data is within a predetermined value. In a case where the output value from the complementary model M2 does not match the teaching data (step S114: NO), the above processing is repeated until the output value from the complementary model M2 matches the complete dump body map. As a result, the parameters of the complementary model M2 are optimized, and the complementary model M2 can be trained.

[0072] In a case where the output value from the complementary model M2 matches the teaching data (step S114: YES), the learning unit 1801 records the complementary model M2 as a trained model including the parameters optimized by the training in the storage 93 (step S115).

(Operation and Effects)

[0073] As described above, according to the first embodiment, the control device 124 specifies the three-dimensional positions of the surface of the transport object L and the bottom surface of the dump body 210 based on the captured image, and generates the dump body map indicating the distribution of the amount of the transport object L on the dump body 210 based on the three-dimensional positions. Accordingly, the control device 124 can specify the distribution of the transport object L on the dump body 210. The operator can recognize the drop position of the transport object L for loading the transport object L on the dump body 210 in a well-balanced manner by recognizing the distribution of the transport object L on the dump body 210.

[0074] The control device 124 according to the first embodiment estimates the distribution of the amount of the transport object L in the shielded part of the dump body map shielded by an obstacle. Accordingly, the operator can recognize the distribution of the amount of the transport object L even for a part of the dump body 210 that is shielded by the obstacle and cannot be captured by the stereo camera 122.

Second Embodiment

[0075] The control device 124 according to a second embodiment specifies the distribution of the transport object L on the dump body 210 based on a type of the transport object L.

[0076] FIG. 9 is a schematic block diagram showing a configuration of a control device according to the second embodiment.

[0077] The control device 124 according to the second embodiment further includes a type specifying unit 1710. The storage 93 stores a type specifying model M3 and a plurality of complementary models M2 according to the type of the transport object L.

[0078] The type specifying unit 1710 inputs an image of the transport object L to the type specifying model M3 to specify the type of the transport object L shown in the image. Examples of the type of transport object include clay, sand, gravel, rock, and wood.

[0079] The type specifying model M3 is realized by, for example, a trained model of deep neural network (DNN). The type specifying model M3 is a trained model which is trained, when an image showing the transport object L is input, to output the type of the transport object L. As a DNN configuration of the type specifying model M3, for example, the same or similar DNN configuration as a DNN configuration used for image recognition can be used. The type specifying model M3 is trained, for example, using a combination of an image showing the transport object L and a label representing the type of the transport object L as teaching data. The type specifying model M3 is trained using a combination of an image showing the transport object L and label data representing the type of the transport object L as the teaching data. The type specifying model M3 may be trained by transfer learning of a general trained image recognition model. The type specifying model M3 according to another embodiment may be trained by unsupervised learning or reinforcement learning.

[0080] The storage 93 stores the complementary model M2 for each type of the transport object L. For example, the storage 93 stores a complementary model M2 for clay, a complementary model M2 for sand, a complementary model M2 for gravel, a complementary model M2 for rock, and a complementary model M2 for wood. Each complementary model M2 is trained, for example, using a combination of a complete dump body map generated by simulation or the like according to the type of the transport object L and an incomplete dump body map obtained by removing part of the height data from the dump body map as teaching data.

(Display Method)

[0081] FIG. 10 is a flowchart showing a display method of the guidance information by the control device according to the second embodiment.

[0082] First, the data acquisition unit 1701 acquires the stereo image from the stereo camera 122 (step S21). Next, the feature point specifying unit 1702 inputs the right-eye image of the stereo image acquired by the data acquisition unit 1701 to the feature point specifying model M1 stored in the storage 93 to specify the positions of the plurality of feature points of the dump body 210 shown in the right-eye image (step S22).

[0083] The three-dimensional data generation unit 1703 generates the point group data of the entire imaging range of the stereo camera 122 by the stereo measurement using the stereo image acquired in step S21 and the camera parameters stored in the storage 93 (step S23).

[0084] The dump body specifying unit 1704 specifies the three-dimensional positions of the feature points based on the positions of the feature points specified in step S22 and the point group data generated in step S23 (step S24). The dump body specifying unit 1704 fits the dump body model VD stored in the storage 93 to the specified positions of the feature points to specify the three-dimensional position of the bottom surface of the dump body 210 (step S25). For example, the dump body specifying unit 1704 disposes, based on at least three specified positions of the feature points, the dump body model VD created based on the dimensions of the dump truck 200 to be detected in a virtual space.

[0085] The surface specifying unit 1705 extracts, from the point group data generated in step S23, a plurality of three-dimensional points in a prismatic area, which is surrounded by the front panel, the side gate, and the tailgate of the dump body 210 specified in step S25 and extends in the height direction of the front panel, to remove three-dimensional points corresponding to the background from the point group data (step S26). The surface specifying unit 1705 removes three-dimensional points corresponding to the position of the dump body model VD from the plurality of three-dimensional points extracted in step S6 to specify the three-dimensional position of the surface of the transport object L loaded on the dump body 210 (step S27). The distribution specifying unit 1706 generates the dump body map which is an elevation map with the bottom surface of the dump body 210 as a reference height, based on the plurality of three-dimensional points extracted in step S27 and the bottom surface of the dump body 210 (step S28). The dump body map may include a grid without the height data.

[0086] The surface specifying unit 1705 specifies an area where the transport object L appears in the right-eye image based on the three-dimensional position of the surface of the transport object L specified in step S27 (step S29). For example, the surface specifying unit 1705 specifies a plurality of pixels in the right-eye image corresponding to the plurality of three-dimensional points extracted in step S27 and determines an area composed of the plurality of specified pixels as the area where the transport object L appears. The type specifying unit 1710 extracts the area where the transport object L appears from the right-eye image and inputs an image related to the area to the type specifying model M3 to specify the type of the transport object L (step S30).

[0087] The distribution estimation unit 1707 inputs the dump body map generated in step S28 to the complementary model M2 associated with the type specified in step S30 to generate the dump body map in which the height data is complemented (step S31). The guidance information generation unit 1708 generates the guidance information based on the dump body map (step S32). The display control unit 1709 outputs the display signal for displaying the guidance information to the display device 125 (step S33).

(Operation and Effects)

[0088] As described above, according to the second embodiment, the control device 124 estimates the distribution of the amount of the transport object L in the shielded part based on the type of the transport object L. That is, characteristics (for example, the angle of repose) of the transport object L loaded on the dump body 210 differ depending on the type of the transport object L. However, according to the third embodiment, it is possible to more accurately estimate the distribution of the transport object L in the shielded part according to the type of the transport object L.

Another Embodiment

[0089] Although embodiments have been described in detail with reference to the drawings, a specific configuration is not limited to the above, and various design changes and the like can be made.

[0090] For example, although the control device 124 according to the above-described embodiment is mounted on the hydraulic excavator 100, the present invention is not limited thereto. For example, the control device 124 according to another embodiment may be provided in a remote server device. The control device 124 may be realized by a plurality of computers. In this case, part of the configuration of the control device 124 may be provided in the remote server device. That is, the control device 124 may be implemented as a transport object specifying system composed of a plurality of devices.

[0091] Although the drop target according to the above-described embodiment is the dump body 210 of the dump truck 200, the present invention is not limited thereto. For example, the drop target according to another embodiment may be another drop target such as a hopper.

[0092] Although the captured image according to the above-described embodiment is the stereo image, the present invention is not limited thereto. For example, in another embodiment, the calculation may be performed based on one image instead of the stereo image. In this case, the control device 124 can specify the three-dimensional position of the transport object L by using, for example, a trained model that generates depth information from the one image.

[0093] Although the control device 124 according to the above-described embodiment complements the value of the shielded part of the dump body map by using the complementary model M2, the present invention is not limited thereto. For example, the control device 124 according to another embodiment may estimate a height of the shielded part based on a rate of change or a pattern of change in the height of the transport object L near the shielded part. For example, in a case where the height of the transport object L near the shielded part becomes lower as it approaches the shielded part, the control device 124 may estimate the height of the transport object L in the shielded part to a value lower than the height near the shielded part based on the rate of change in the height.

[0094] The control device 124 according to another embodiment may estimate the height of the transport object L in the shielded part by simulation in consideration of a physical property such as the angle of repose of the transport object L. The control device 124 according to another embodiment may deterministically estimate the height of the transport object L in the shielded part based on cellular automaton in which each grid of the dump body map is regarded as a cell.

[0095] The control device 124 according to another embodiment may not complement the dump body map and may display information related to the dump body map including a part where the height data is missing.

[0096] FIG. 11A is a diagram showing a first example of a method for calculating an amount of a transport object in a dump body. FIG. 11B is a diagram showing a second example of the method for calculating the amount of the transport object in the dump body.

[0097] As shown in FIG. 11A, the dump body map according to the above-described embodiment is represented by a height from a bottom surface L1 of the dump body 210 to an upper limit of the loading on the dump body 210, but the present invention is not limited thereto.

[0098] For example, as shown in FIG. 11B, the dump body map according to another embodiment may represent a height from another reference plane L3 with respect to the bottom surface to a surface L2 of the transport object L. In the example shown in FIG. 11B, the reference plane L3 is a plane parallel to the ground surface and passing through a point of the bottom surface closest to the ground surface. In this case, the operator can easily recognize the amount of the transport object L until the dump body 210 is full, regardless of an inclination of the dump body 210.

[0099] Although the control device 124 according to the above-described embodiment generates the dump body map based on the bottom surface of the dump body 210 and the surface of the transport object L, the present invention is not limited thereto. For example, the control device 124 according to another embodiment may calculate the dump body map based on an opening surface of the dump body 210, the surface of the transport object, and a height from the bottom surface to the opening surface of the dump body 210. That is, the control device 124 may calculate the dump body map by subtracting, from the height from the bottom surface to the opening surface of the dump body 210, a distance from an upper end surface of the dump body to the surface of the transport object L. The dump body map according to another embodiment may be based on the opening surface of the dump body 210.

[0100] Although the guidance information generation unit 1708 according to the above-described embodiment extracts the feature points from the right-eye image using the feature point specifying model M1, the present invention is not limited thereto. For example, in another embodiment, the guidance information generation unit 1708 may extract the feature points from the left-eye image using the feature point specifying model M1.

[0101] The transport object specifying device according to the present invention can specify the distribution of the transport object in the drop target.

* * * * *