In the past decades, the speed development of the Web and a large amount of data published through the Web have made it the largest public data source in the world. The network has become a carrier of massive information. How to efficiently classify text for the acquired massive information is a hot issue of current research. The traditional machine learning algorithms for text classification have many disadvantages such as inconspicuous text features, long training period and loss of word order. This article puts forward a BERT model based method for technology information text auto-Categoriz to improve the accuracy text classification of science and technology information. The results suggest that the using method has significantly improved accuracy, recall and fl_score, and has a good Chinese text classification effect.
Discussion(0)
No comments yet. Be the first to comment.