Commit Graph

8 Commits

Author SHA1 Message Date
Houzhong Xu 7ec0a10116
33
Deploy Nomad Configurations / deploy-nomad (push) Failing after 2m49s Details
Infrastructure CI/CD / Validate Infrastructure (push) Failing after 30s Details
Infrastructure CI/CD / Plan Infrastructure (push) Has been skipped Details
Infrastructure CI/CD / Apply Infrastructure (push) Has been skipped Details
Simple Test / test (push) Successful in 4s Details
2025-10-18 00:21:33 +00:00
Houzhong Xu 4381428b5d
feat: 重构基础设施配置与安全凭证管理
Infrastructure CI/CD / Validate Infrastructure (push) Failing after 9s Details
Infrastructure CI/CD / Plan Infrastructure (push) Has been skipped Details
Infrastructure CI/CD / Apply Infrastructure (push) Has been skipped Details
Simple Test / test (push) Successful in 1s Details
- 新增多个云服务商配置文件(OCI、阿里云)
- 重构Vault、Consul、Nomad等服务的部署配置
- 新增备份与恢复完美状态的脚本
- 更新安全凭证管理文档
- 优化Traefik动态配置
- 删除过时的脚本和配置文件

重构后的配置支持多区域部署,优化了服务发现和负载均衡机制,并完善了安全凭证的备份与恢复流程。
2025-10-13 03:08:22 +00:00
Houzhong Xu 05979bdc03
🔗 Add Grafana route to Traefik
Infrastructure CI/CD / Validate Infrastructure (push) Failing after 7s Details
Infrastructure CI/CD / Plan Infrastructure (push) Has been skipped Details
Infrastructure CI/CD / Apply Infrastructure (push) Has been skipped Details
Simple Test / test (push) Successful in 2s Details
 Added Grafana service and router configuration:
- Service: grafana-cluster → http://influxdb.tailnet-68f9.ts.net:3000
- Router: grafana-ui → Host(grafana.git-4ta.live)
- Health check: /api/health endpoint
- SSL: Cloudflare certificate resolver

🌐 Access URL: https://grafana.git-4ta.live
- Redirects to /login (working correctly)
- Full SSL/TLS support via Cloudflare

Deployed and tested successfully 
2025-10-12 09:17:33 +00:00
Houzhong Xu 1eafce7290
🎉 Complete Nomad monitoring infrastructure project
Deploy Nomad Configurations / deploy-nomad (push) Failing after 29s Details
Infrastructure CI/CD / Validate Infrastructure (push) Failing after 11s Details
Simple Test / test (push) Successful in 1s Details
Infrastructure CI/CD / Plan Infrastructure (push) Has been skipped Details
Infrastructure CI/CD / Apply Infrastructure (push) Has been skipped Details
 Major Achievements:
- Deployed complete observability stack (Prometheus + Loki + Grafana)
- Established rapid troubleshooting capabilities (3-step process)
- Created heatmap dashboard for log correlation analysis
- Unified logging system (systemd-journald across all nodes)
- Configured API access with Service Account tokens

🧹 Project Cleanup:
- Intelligent cleanup based on Git modification frequency
- Organized files into proper directory structure
- Removed deprecated webhook deployment scripts
- Eliminated 70+ temporary/test files (43% reduction)

📊 Infrastructure Status:
- Prometheus: 13 nodes monitored
- Loki: 12 nodes logging
- Grafana: Heatmap dashboard + API access
- Promtail: Deployed to 12/13 nodes

🚀 Ready for Terraform transition (静默一周后切换)

Project Status: COMPLETED 
2025-10-12 09:15:21 +00:00
Houzhong Xu eff8d3ec6d
REMOVE: 删除不再使用的 Terraform 配置文件
Deploy Nomad Configurations / deploy-nomad (push) Failing after 7m45s Details
Infrastructure CI/CD / Validate Infrastructure (push) Failing after 2m33s Details
Infrastructure CI/CD / Plan Infrastructure (push) Has been skipped Details
Infrastructure CI/CD / Apply Infrastructure (push) Has been skipped Details
Simple Test / test (push) Failing after 2m48s Details
- 移除 nomad-terraform.tf 和 test_opentofu_consul.tf 文件
- 更新 Ansible inventory,注释掉不存在的节点 hcp2
- 修改 inventory.ini,确保节点配置的准确性
- 在 nomad-config 模块中添加 null_provider 以支持新配置
- 更新 influxdb1.hcl,添加 Grafana 和 Prometheus 数据卷配置
2025-10-10 13:53:41 +00:00
Houzhong Xu 45f93cc68c
SWITCH: 从 Ansible 切换到 Terraform 管理 Nomad 配置
Infrastructure CI/CD / Validate Infrastructure (push) Failing after 19s Details
Infrastructure CI/CD / Plan Infrastructure (push) Has been skipped Details
Infrastructure CI/CD / Apply Infrastructure (push) Has been skipped Details
Simple Test / test (push) Successful in 5s Details
- 创建 nomad-config Terraform 模块
- 声明式管理 Nomad 节点配置
- 更新 GitOps 工作流使用 Terraform
- 避免配置漂移,确保主客观统一
- 目标: 通过 IaC 修复 5个异常节点
2025-10-09 13:15:57 +00:00
Houzhong Xu edae611b31
Test webhook deployment
- Add OCI credentials to Consul
- Configure OpenTofu plugin cache
- Test GitOps automation
2025-10-09 06:45:42 +00:00
Houzhong Xu 89ee6f7967
Clean repository: organized structure and GitOps setup
- Organized root directory structure
- Moved orphan files to proper locations
- Updated .gitignore to ignore temporary files
- Set up Gitea Runner for GitOps automation
- Fixed Tailscale access issues
- Added workflow for automated Nomad deployment
2025-10-09 06:13:45 +00:00