📢 Gate Square #Creator Campaign Phase 1# is now live – support the launch of the PUMP token sale!
The viral Solana-based project Pump.Fun ($PUMP) is now live on Gate for public sale!
Join the Gate Square Creator Campaign, unleash your content power, and earn rewards!
📅 Campaign Period: July 11, 18:00 – July 15, 22:00 (UTC+8)
🎁 Total Prize Pool: $500 token rewards
✅ Event 1: Create & Post – Win Content Rewards
📅 Timeframe: July 12, 22:00 – July 15, 22:00 (UTC+8)
📌 How to Join:
Post original content about the PUMP project on Gate Square:
Minimum 100 words
Include hashtags: #Creator Campaign
DeepSeek V3 update reshapes the AI development landscape, with Computing Power and Algorithm coexisting to lead a new direction.
DeepSeek V3 Update: Redefining the Direction of AI Development
Recently, DeepSeek released the latest V3 version update, with model parameters reaching 68.5 billion, showing significant improvements in coding capabilities, UI design, and reasoning abilities. This update has sparked heated discussions in the industry regarding the relationship between computing power and algorithms, especially at the recently concluded 2025 GTC conference, where industry insiders emphasized that efficient models will not reduce the demand for chips, and future computing needs will only increase.
The Symbiotic Evolution of Computing Power and Algorithms
In the field of AI, the enhancement of computing power provides a foundation for complex algorithms to run, while the optimization of algorithms can utilize computing power more efficiently. This symbiotic relationship is reshaping the AI industry landscape:
Technical Innovations of DeepSeek
The success of DeepSeek is inseparable from its technological innovations, which are mainly reflected in the following aspects:
Model Architecture Optimization
Using a Transformer + MOE combined architecture, introducing a Multi-Head Latent Attention mechanism (MLA). This architecture acts like a super team, with the Transformer handling regular tasks, the MOE functioning like an expert group addressing specific issues, and the MLA allowing the model to flexibly focus on important details.
Innovative Training Methods
Propose an FP8 mixed precision training framework that dynamically selects computational precision based on training requirements, improving training speed and reducing memory usage while ensuring accuracy.
Improvement in inference efficiency
Introducing Multi-Token Prediction (MTP) technology, which predicts multiple tokens at once, significantly improving inference speed and reducing costs.
Breakthrough in Reinforcement Learning Algorithms
The new GRPO algorithm optimizes the model training process, achieving a balance between performance enhancement and cost reduction by minimizing unnecessary computations.
These innovations have formed a complete technical system that reduces computing power requirements across the entire chain from training to inference, allowing ordinary consumer-grade graphics cards to run powerful AI models, significantly lowering the barriers to AI applications.
Impact on Chip Manufacturers
DeepSeek optimizes algorithms through the PTX layer, which has a dual impact on chip manufacturers: on one hand, it deepens the binding with hardware and the ecosystem, potentially expanding the overall market size; on the other hand, algorithm optimization may change the market demand structure for high-end chips.
Significance for China's AI Industry
DeepSeek's algorithm optimization provides a technological breakthrough path for China's AI industry. Against the backdrop of restrictions on high-end chips, the idea of "software complementing hardware" reduces dependence on top imported chips. Upstream computing power service providers can extend the hardware usage cycle through software optimization, while downstream lowers the threshold for AI application development, giving rise to more AI solutions in vertical fields.
The Profound Impact of Web3 + AI
Decentralized AI Infrastructure
DeepSeek's innovations enable decentralized AI inference. The MoE architecture is suitable for distributed deployment, and the FP8 training framework reduces the demand for high-end computing resources, allowing more computing resources to join the node network.
Multi-Agent Systems
DeepSeek seeks breakthroughs through algorithmic innovation, opening up differentiated development paths for the AI industry. The future development of AI will be a competition of collaborative optimization between computing power and algorithms, with innovators redefining the rules of the game with new ideas.