of using the EFP, of using ACT;
The STFT;
added Table. I in the revised manuscript
convolutional layers
the number of overlapped samples
To achieve our goal of separation of the anisotropic Gaussian window
revised Fig. 10
the RBF kernal
in each image
achieves higher classification accuracy than
of a specific stage
MobileNet extracts features from the "conv2d 11" layer whose size of output feature maps
but the number of rows is defined as follows.
when you use "An", it means that it is not necessarily architecture that you are using.
The architecture of TFFNet.
use the same time range [0,0.75] s
The observed signal has a length of 150000 samples within a time interval of 0.75 seconds
traditional FPN for construction of a multi-resolution
classier' mAP for the UWA communication signals dataset
we revised the manuscript as follows
We have revised the text as follows.
We have revised the following text.
This has been corrected.
The classication performance of the TFFNet is compared with that of two machine learning methods, random forest (RF)
image of a size 299 *299 is
is lower than that of RF and SVM-RBF
The RBF classification is less accurate compared to SVM-RBF.
In this work/ Using this way
extract the data
TFFNet with STFT results in a lower mAP
references [21][22][23] in the revised manuscript
traditional FPN for construction of a multi-resolution
section II
allows adjustment of the trade-off between complexity and energy concentration
Beluga whale and sperm whale sounds
a list of, values with
Parameters and each is a list of values [0.2, 0.5, 0.8, 1.1, 1.4, 1.7, 2.0,2.3].
The factor does not contain
For the high efficiency of sparse ACT
should be large values
second one with impulses (spermwhales's clicks)
We could not do this for FSST
experiment cannot be executed
a key factor influencing the classification
we change the coordinate system as in [31]
are learnt from UWA signals.
The work [12] has ...
The parameter is selected ( is a parameter, not an operator)
total number of samples
the number of samples within each class
There are several ways in
The drawback of these methods is their low capacity so that the addition of more training data
a parameter one can tune.
MobileNet, VGG-16 and Inception V3 backbone networks extract features
There are 680 signals with each type of modulation, of which 500, 110, and 70 are used for training, validation and testing, respectively.
Performance comparison for di erent classification tasks.
three trained classiers, namely RF, SVM-RBF and TFFNet