戒酒的李白
Committed by GitHub

Update README.md

Showing 1 changed file with 25 additions and 128 deletions
@@ -10,146 +10,43 @@ @@ -10,146 +10,43 @@
10 [![GitHub Contributors](https://img.shields.io/github/contributors/666ghj/Weibo_PublicOpinion_AnalysisSystem?style=flat-square)](https://github.com/666ghj/Weibo_PublicOpinion_AnalysisSystem/graphs/contributors) 10 [![GitHub Contributors](https://img.shields.io/github/contributors/666ghj/Weibo_PublicOpinion_AnalysisSystem?style=flat-square)](https://github.com/666ghj/Weibo_PublicOpinion_AnalysisSystem/graphs/contributors)
11 [![GitHub License](https://img.shields.io/github/license/666ghj/Weibo_PublicOpinion_AnalysisSystem?style=flat-square)](https://github.com/666ghj/Weibo_PublicOpinion_AnalysisSystem/blob/main/LICENSE) 11 [![GitHub License](https://img.shields.io/github/license/666ghj/Weibo_PublicOpinion_AnalysisSystem?style=flat-square)](https://github.com/666ghj/Weibo_PublicOpinion_AnalysisSystem/blob/main/LICENSE)
12 12
13 - [English](./README.md) | [中文文档](./README-CN.md)  
14 </div> 13 </div>
15 14
16 -🚀The latest 2.1 version has fully upgraded AI modes———welcome to experience it!⬇️ 15 + ### **[Important Announcement] Refactoring Plan for Weibo_PublicOpinion_AnalysisSystem**
17 16
18 -<div style="display: flex; justify-content: center; align-items: center; gap: 10px; width: 90%; margin: auto;">  
19 - <img src="https://github.com/user-attachments/assets/52d07a31-111a-47e7-86f2-a1bafccfe0a5" style="width: 45%; height: auto;">  
20 - <img src="https://github.com/user-attachments/assets/33a8baf7-e365-4eaf-9620-4f6e9a23fa89" style="width: 45%; height: auto;">  
21 -</div>  
22 -  
23 -**Weibo Public Opinion Analysis and Prediction System** is a **social network public opinion analysis system** designed to monitor, analyze, and predict public opinion trends on social media platforms such as Weibo. This system leverages deep learning, natural language processing (NLP), and machine learning technologies to extract valuable public opinion information from vast amounts of social media data, helping governments, enterprises, and other organizations promptly understand public attitudes, respond to emergencies, and optimize decision-making. 📈  
24 -  
25 -Through powerful data collection and processing capabilities, the Weibo Public Opinion Analysis and Prediction System achieves real-time data collection, sentiment analysis, topic classification, and public opinion prediction, ensuring that users can obtain accurate and comprehensive insights into public opinion in the complex and changing social network environment. The system adopts a modular design, making it easy to maintain and expand, aiming to provide users with an efficient and reliable public opinion analysis tool, assisting various organizations in making informed decisions in the information age.  
26 -  
27 -## ✨ Features  
28 -  
29 -- **Real-time Data Collection**: Utilize web scraping technologies to obtain user-generated content from social platforms like Weibo in real-time.  
30 -- **Data Cleaning and Processing**: Preprocess collected data, including tokenization, removal of stop words, emojis, and URLs.  
31 -- **Topic Classification**: Automatically classify posts and comments into topics using machine learning and natural language processing techniques.  
32 -- **Sentiment Analysis**: Analyze the sentiment orientation (positive, neutral, negative) within texts to understand public emotions.  
33 -- **Public Opinion Monitoring and Prediction**: Monitor changes in public opinion in real-time and predict future trends based on historical data.  
34 -- **Data Visualization**: Display analysis results through charts and graphics for easy understanding and decision-making.  
35 -- **User Management**: Provide user registration, login, and session management features to ensure system security and personalized services.  
36 -  
37 -## 🚀 Getting Started  
38 -  
39 -Follow the steps below to run the project on your system.  
40 -  
41 -### Prerequisites  
42 -  
43 -- [Python](https://www.python.org/) 3.7 or higher  
44 -- [MySQL](https://www.mysql.com/) Database  
45 -- [Conda](https://docs.conda.io/en/latest/) (optional, for environment management)  
46 -- A valid Weibo account (for data collection)  
47 -- At least one of the following API keys for AI analysis features:  
48 - - OpenAI API key  
49 - - Anthropic (Claude) API key  
50 - - DeepSeek API key  
51 -  
52 -### Installation Steps  
53 -  
54 -1. Clone the repository:  
55 - ```bash  
56 - git clone https://github.com/666ghj/Weibo_PublicOpinion_AnalysisSystem.git  
57 - cd Weibo-Public-Opinion-Analysis-System  
58 -  
59 -2. Create and activate a virtual environment (optional):  
60 -  
61 - ```bash  
62 - conda create -n weibo_opinion_analysis python=3.8  
63 - conda activate weibo_opinion_analysis  
64 - ```  
65 -  
66 -3. Install dependencies:  
67 -  
68 - ```bash  
69 - pip install -r requirements.txt  
70 - ```  
71 -  
72 -4. Configure the MySQL database:  
73 -  
74 - - Run `createTables.sql` to create the necessary database tables.  
75 - - Modify the database connection settings in `config.py` to match your MySQL configuration.  
76 -  
77 -5. Configure AI Analysis (Optional):  
78 -  
79 - Set up environment variables for AI analysis features:  
80 - ```bash  
81 - # For OpenAI API (Required for GPT models)  
82 - export OPENAI_API_KEY="your-openai-key"  
83 -  
84 - # For Anthropic API (Required for Claude models)  
85 - export ANTHROPIC_API_KEY="your-anthropic-key"  
86 -  
87 - # For DeepSeek API (Required for DeepSeek models)  
88 - export DEEPSEEK_API_KEY="your-deepseek-key"  
89 - ```  
90 -  
91 - Note: At least one API key must be configured to use AI analysis features.  
92 -  
93 - Supported AI Models:  
94 - - OpenAI: GPT-3.5-Turbo, GPT-4  
95 - - Anthropic: Claude-3 (Opus, Sonnet, Haiku)  
96 - - DeepSeek: DeepSeek-V3 (deepseek-chat), DeepSeek-R1 (deepseek-reasoner)  
97 -  
98 -6. Start the Flask application:  
99 -  
100 - ```bash  
101 - python app.py  
102 - ```  
103 -  
104 -7. Access the application: Open your browser and navigate to http://localhost:5000 to use the system.  
105 -  
106 -## 🛠️ Technology Stack  
107 -  
108 -The Weibo Public Opinion Analysis and Prediction System employs a range of modern technologies to ensure efficiency and scalability:  
109 -  
110 -- **[Flask](https://flask.palletsprojects.com/en/stable/)** - A lightweight web application framework.  
111 -- **[MySQL](https://www.mysql.com/)** - A relational database used to store collected and processed data.  
112 -- **[Scrapy](https://scrapy.org/)** - A powerful web scraping framework used for data collection.  
113 -- **[Jieba](https://github.com/fxsjy/jieba)** - A Chinese text segmentation tool used for text preprocessing.  
114 -- **[SnowNLP](https://github.com/isnowfy/snownlp)** - A Chinese natural language processing library used for sentiment analysis.  
115 -- **[BERT](https://github.com/google-research/bert)** - A pre-trained language model used for topic classification.  
116 -- **[Pandas](https://pandas.pydata.org/)** - A data analysis and manipulation library.  
117 -- **[Matplotlib](https://matplotlib.org/)** - A data visualization library.  
118 -- **[Scikit-learn](https://scikit-learn.org/)** - A machine learning library used for model training and evaluation.  
119 -- **[TensorFlow](https://www.tensorflow.org/)** or **[PyTorch](https://pytorch.org/)** - Deep learning frameworks used for advanced model development.  
120 -- **[OpenAI GPT](https://openai.com/)** - Advanced language models for text analysis.  
121 -- **[Anthropic Claude](https://www.anthropic.com/)** - AI models for sophisticated text analysis.  
122 -- **[DeepSeek](https://deepseek.com/)** - Advanced Chinese-English bilingual AI models.  
123 -  
124 -## 🤝 Contribution  
125 -  
126 -We welcome your contributions! Follow the steps below to participate in the project: 17 +Dear all contributors, users, and followers,
127 18
128 -1. Fork this repository.  
129 -2. Create your feature branch (`git checkout -b feature/your-feature`).  
130 -3. Commit your changes (`git commit -m 'Add some feature'`).  
131 -4. Push to the branch (`git push origin feature/your-feature`).  
132 -5. Open a Pull Request. 19 +Hello everyone,
133 20
134 -Please ensure that all tests pass before submitting and follow the project's coding standards. 21 +I am the initiator and main developer of this project. First and foremost, I want to personally thank you for your continued attention, contributions, and enthusiasm for the `Weibo_PublicOpinion_AnalysisSystem` project.
135 22
136 -## 📜 License 23 +Over the past period, as the project has expanded, I have noticed several challenges that require attention:
137 24
138 -This project is licensed under the [GPL-2.0 License](https://github.com/666ghj/Weibo_PublicOpinion_AnalysisSystem/blob/main/LICENSE) - see the [LICENSE](https://github.com/666ghj/Weibo_PublicOpinion_AnalysisSystem/blob/main/LICENSE) file for details. 25 +1. **Architectural and Module Issues:** Through rapid iteration, many modules have been integrated. However, a lack of unified top-level design has led to some module conflicts and a need for structural optimization.
  26 +2. **High Barrier to Entry:** A significant current challenge is that users need to configure their own crawlers and scrape data from scratch. This makes the deployment and startup process relatively complex, creating an inconvenience for many new users.
  27 +3. **Development and Presentation Limitations:** The development progress of various functional modules has been uneven. Additionally, the existing dashboard paradigm has limitations in compatibility and scalability that hinder my future development goals.
  28 +4. **Constraints of the Self-Trained Model:** Considering its size and maintenance costs, the previously trained model has become a constraint on the project's long-term development.
139 29
140 -## 🌟 Show Your Support 30 +After a careful evaluation of these points, and in light of current technological trends (especially in LLMs, and Agents), I have decided to initiate a **comprehensive, bottom-up architectural refactoring** of the project, with the goal of providing a more user-friendly tool for everyone.
141 31
142 -If you like this project, please give it a star ⭐ on [GitHub](https://github.com/666ghj/Weibo_PublicOpinion_AnalysisSystem)! 32 +**My next update plan will focus on:**
143 33
144 -## 📫 Contact Us 34 +1. **Optimizing the Core Architecture:** I will be moving away from the current dashboard-centric presentation to design a more lightweight and flexible system framework.
  35 +2. **Focusing on Core Competencies:** The new architecture will refocus my efforts on the crawling, processing, and in-depth analysis of Weibo data, aiming to build a stable and efficient data core.
  36 +3. **Integrating Advanced Large Language Models (LLMs):** I plan to discontinue maintenance of the self-trained model and will instead utilize APIs to call mainstream large language models for analysis tasks, enhancing the system's analytical capabilities and flexibility.
  37 +4. **The Ultimate Goal: A New Model of "Deployable Core + Online Service":**
  38 + - **For Developers:** I aim to refine the project into a **"minimal, user-friendly, low-cost, modular"** public opinion analysis **core engine** to facilitate secondary development and private deployment.
  39 + - **For General Users:** Leveraging the new architecture, I **plan to introduce a new "Online Service" version, designed to address the challenges of deployment and data acquisition.**
  40 + - **Providing a Shared Database:** I will begin building and maintaining a **continuously updated, shared database**. This will allow users to access our data source directly, **removing the need to configure and run their own crawlers.**
  41 + - **Simplifying the User Experience:** This will eliminate the need for a complex local setup, enabling a **click-to-use** experience.
  42 + - **Retaining Personalized Analysis:** Users will still be able to configure their own LLM API keys in the online service to perform personalized, in-depth analysis with our data core.
145 43
146 -If you have any questions or suggestions, feel free to contact us through the following methods: 44 +This refactoring is a necessary step in our development. I understand this will require adjusting and, in some cases, rewriting code to which many of you have contributed. However, for the long-term health of the project and to make it accessible to a broader audience, I believe this step is essential.
147 45
148 -- GitHub Issues: [Create a new issue](https://github.com/666ghj/Weibo_PublicOpinion_AnalysisSystem/issues)  
149 -- Email: 670939375@qq.com 46 +In the coming weeks, I will begin to outline the new project blueprint and will keep the community updated on my progress. I value your wisdom and support now more than ever.
150 47
151 -## ✨ Contributors 48 +Thank you once again for your understanding and support! Let's look forward to the next evolution of `Weibo_PublicOpinion_AnalysisSystem`.
152 49
153 -Thanks to the following contributors: 50 +Sincerely,
154 51
155 -[![Contributors](https://contrib.rocks/image?repo=666ghj/Weibo_PublicOpinion_AnalysisSystem)](https://github.com/666ghj/Weibo_PublicOpinion_AnalysisSystem/graphs/contributors) 52 +Project Initiator