Houzhong Xu
							
						 | 
						
							
							
							
								
								
							
							
							
								
							
							
								1eafce7290
								
							
						 | 
						
							
							
								
								🎉 Complete Nomad monitoring infrastructure project
							
							
							
							
							
							
	
		
			
	
		
	
	
		
			
				
	
				Deploy Nomad Configurations / deploy-nomad (push) Failing after 29s
				
					Details
				
			 
		
			
				
	
				Infrastructure CI/CD / Validate Infrastructure (push) Failing after 11s
				
					Details
				
			 
		
			
				
	
				Simple Test / test (push) Successful in 1s
				
					Details
				
			 
		
			
				
	
				Infrastructure CI/CD / Plan Infrastructure (push) Has been skipped
				
					Details
				
			 
		
			
				
	
				Infrastructure CI/CD / Apply Infrastructure (push) Has been skipped
				
					Details
				
			 
		
	 
							
							✅ Major Achievements:
- Deployed complete observability stack (Prometheus + Loki + Grafana)
- Established rapid troubleshooting capabilities (3-step process)
- Created heatmap dashboard for log correlation analysis
- Unified logging system (systemd-journald across all nodes)
- Configured API access with Service Account tokens
🧹 Project Cleanup:
- Intelligent cleanup based on Git modification frequency
- Organized files into proper directory structure
- Removed deprecated webhook deployment scripts
- Eliminated 70+ temporary/test files (43% reduction)
📊 Infrastructure Status:
- Prometheus: 13 nodes monitored
- Loki: 12 nodes logging
- Grafana: Heatmap dashboard + API access
- Promtail: Deployed to 12/13 nodes
🚀 Ready for Terraform transition (静默一周后切换)
Project Status: COMPLETED ✅ 
							
						 | 
						
							2025-10-12 09:15:21 +00:00 | 
						
					
				
					
						
							
							
								
								 Houzhong Xu
							
						 | 
						
							
							
							
								
								
							
							
							
								
							
							
								eff8d3ec6d
								
							
						 | 
						
							
							
								
								REMOVE: 删除不再使用的 Terraform 配置文件
							
							
							
							
							
							
	
		
			
	
		
	
	
		
			
				
	
				Deploy Nomad Configurations / deploy-nomad (push) Failing after 7m45s
				
					Details
				
			 
		
			
				
	
				Infrastructure CI/CD / Validate Infrastructure (push) Failing after 2m33s
				
					Details
				
			 
		
			
				
	
				Infrastructure CI/CD / Plan Infrastructure (push) Has been skipped
				
					Details
				
			 
		
			
				
	
				Infrastructure CI/CD / Apply Infrastructure (push) Has been skipped
				
					Details
				
			 
		
			
				
	
				Simple Test / test (push) Failing after 2m48s
				
					Details
				
			 
		
	 
							
							- 移除 nomad-terraform.tf 和 test_opentofu_consul.tf 文件
- 更新 Ansible inventory,注释掉不存在的节点 hcp2
- 修改 inventory.ini,确保节点配置的准确性
- 在 nomad-config 模块中添加 null_provider 以支持新配置
- 更新 influxdb1.hcl,添加 Grafana 和 Prometheus 数据卷配置 
							
						 | 
						
							2025-10-10 13:53:41 +00:00 | 
						
					
				
					
						
							
							
								
								 Houzhong Xu
							
						 | 
						
							
							
							
								
								
							
							
							
								
							
							
								09dca62603
								
							
						 | 
						
							
							
								
								FIX: 统一 Ansible inventory 并创建 Nomad 节点修复 playbook
							
							
							
							
							
							
	
		
			
	
		
	
	
		
			
				
	
				Simple Test / test (push) Successful in 6s
				
					Details
				
			 
		
	 
							
							- 统一使用 ben/3131 凭据
- 删除重复的 pve inventory
- 创建 fix-nomad-nodes.yml 修复异常节点
- 基于 warden 成功配置创建 Nomad 客户端模板
- 目标修复: ch4, hcp1, warden, ash1d (ash2e 连接超时) 
							
						 | 
						
							2025-10-09 13:03:03 +00:00 | 
						
					
				
					
						
							
							
								
								 Houzhong Xu
							
						 | 
						
							
							
							
								
								
							
							
							
								
							
							
								89ee6f7967
								
							
						 | 
						
							
							
								
								Clean repository: organized structure and GitOps setup
							
							
							
							
							
							
							
							- Organized root directory structure
- Moved orphan files to proper locations
- Updated .gitignore to ignore temporary files
- Set up Gitea Runner for GitOps automation
- Fixed Tailscale access issues
- Added workflow for automated Nomad deployment 
							
						 | 
						
							2025-10-09 06:13:45 +00:00 |