Decoding Protein Misfolding at Scale
Advanced computational platform for protein structure analysis, misfolding prediction, and precision therapeutic intervention in neurodegenerative disorders
Features β’ Demo β’ Installation β’ Usage β’ Technology β’ Contributing
- Overview
- Authors
- Features
- Demo
- Technology Stack
- Installation
- Usage
- Project Structure
- API Documentation
- Screenshots
- Roadmap
- Contributing
- License
- Acknowledgments
Proteomorphic is a cutting-edge web-based platform that leverages artificial intelligence and computational biology to analyze protein structures, predict misfolding risks, and design therapeutic interventions for neurodegenerative diseases such as Alzheimer's, Parkinson's, and Huntington's disease.
Protein misfolding is a critical factor in numerous neurodegenerative disorders. Traditional analysis methods are time-consuming, expensive, and require specialized expertise. There is a pressing need for accessible, automated tools that can:
- Rapidly analyze protein structures
- Identify misfolding hotspots
- Predict disease risk
- Design targeted therapeutic interventions
Proteomorphic provides an end-to-end automated pipeline that:
- Analyzes protein sequences using AI-powered structure prediction (ESM-2)
- Identifies misfolding hotspots and pathogenic mutations
- Calculates misfolding risk scores based on biochemical properties
- Designs CRISPR guide RNAs for therapeutic intervention
- Generates comprehensive clinical reports with 3D visualizations
- Monitors patient biomarkers through IoT integration
This project was developed by:
| Author | GitHub | Role |
|---|---|---|
| bmsahana14 | @bmsahana14 | Lead Developer |
| 02pasco | @02pasco | Core Developer |
| 007sanju | @007sanju | Core Developer |
| 1ep22ic003 | @1ep22ic003 | Core Developer |
- 3D Structure Visualization: Interactive protein structure viewer using 3Dmol.js
- Secondary Structure Prediction: Alpha helix, beta sheet, and random coil composition
- AI-Powered Analysis: ESM-2 protein language model for advanced predictions
- Biochemical Properties: Instability index, hydrophobicity (GRAVY), and charge analysis
- Hotspot Identification: Detects aggregation-prone regions and pathogenic mutations
- Risk Assessment: Calculates misfolding risk scores (0-100) with confidence levels
- Disease Association: Maps proteins to known neurodegenerative disorders
- Clinical Interpretation: Generates actionable clinical insights
- Automated Guide RNA Design: Generates optimized 20bp guide sequences
- PAM Site Identification: NGG motif detection for SpCas9
- Efficiency Scoring: Predicts editing efficiency based on GC content
- Off-Target Analysis: Evaluates specificity and potential off-target effects
- Delivery System Recommendations: AAV9-PHP.eB for CNS, LNP for systemic delivery
- Real-Time Monitoring: Live patient biomarker tracking
- Multi-Parameter Analysis: Temperature, heart rate, protein levels, oxidative stress
- Risk Correlation: Links biosensor data to misfolding risk
- Alert System: Automated notifications for critical values
- Clinical Reports: Detailed analysis with 3D visualizations
- Export Functionality: PDF generation for medical records
- Historical Tracking: Patient analysis history and trends
- Dashboard Analytics: Visual insights and statistics
- User Authentication: Secure login with Supabase integration
- Role-Based Access: Admin, researcher, and patient roles
- Data Privacy: HIPAA/GDPR compliant architecture
- Encrypted Storage: Secure data handling and transmission
Protein Input β Structure Analysis β Risk Assessment β CRISPR Design β Report Generation
𧬠π β οΈ βοΈ π
Input: Alpha-Synuclein (SNCA) - Parkinson's Disease Protein
Output:
- Misfolding Risk: 87/100 (High Risk)
- Hotspots Detected: A53T, A30P mutations
- CRISPR Guides: 3 optimized guide RNAs designed
- Clinical Recommendation: Therapeutic intervention recommended
- HTML5/CSS3: Modern, responsive UI with glassmorphism design
- JavaScript (ES6+): Dynamic interactions and API integration
- 3Dmol.js: Interactive 3D protein structure visualization
- Google Fonts (Inter): Premium typography
- Python 3.8+: Core analysis engine
- Flask 3.0.0: RESTful API framework
- Flask-CORS: Cross-origin resource sharing
- ESM-2 (650M): Facebook's protein language model
- PyTorch: Deep learning framework
- Transformers: Hugging Face model integration
- NumPy/Pandas: Scientific computing
- Supabase: PostgreSQL database and authentication
- LocalStorage: Client-side data persistence
- UniProt: Protein sequence retrieval
- NCBI: Genetic variant data
- AlphaFold: Structure prediction integration
- Python 3.8 or higher
- Node.js (optional, for development tools)
- Modern web browser (Chrome, Firefox, Edge)
- Git
git clone https://github.com/YOUR_USERNAME/proteomorphic.git
cd proteomorphic# Navigate to backend directory
cd backend
# Create virtual environment
python -m venv venv
# Activate virtual environment
# Windows:
venv\Scripts\activate
# macOS/Linux:
source venv/bin/activate
# Install dependencies
pip install -r requirements.txtCreate a .env file in the backend directory:
FLASK_APP=app.py
FLASK_ENV=development
SECRET_KEY=your_secret_key_here
SUPABASE_URL=your_supabase_url
SUPABASE_KEY=your_supabase_anon_keypython app.pyThe backend will start at http://localhost:5000
# Navigate back to project root
cd ..
# Option 1: Use Python's built-in server
python -m http.server 8000
# Option 2: Use VS Code Live Server extension
# Right-click index.html β "Open with Live Server"The frontend will be available at http://localhost:8000
Open your browser and navigate to:
- Frontend:
http://localhost:8000 - Backend API:
http://localhost:5000/api/health
- Sign Up: Create an account at
/auth/signup.html - Login: Access your dashboard at
/auth/login.html - Analyze Protein:
- Navigate to Analysis β Input
- Enter protein name (e.g., "Alpha-Synuclein")
- Click "Analyze Protein"
- View Report: Review comprehensive analysis with 3D visualization
- Monitor IoT: Check real-time biosensor data (if available)
// Analyze a protein
const response = await fetch('http://localhost:5000/api/analyze', {
method: 'POST',
headers: {
'Content-Type': 'application/json'
},
body: JSON.stringify({
proteinName: 'Alpha-Synuclein',
proteinSequence: 'MDVFMKGLSKAKEGVVAAAEKTKQGVAEAAGKTKEGVLYVGSKTKEGVVHGVATVAEKTKEQVTNVGGAVVTGVTAVAQKTVEGAGSIAAATGFVKKDQLGKNEEGAPQEGILEDMPVDPDNEAYEMPSEEGYQDYEPEA'
})
});
const result = await response.json();
console.log(result);{
"proteinId": "Alpha-Synuclein",
"proteinName": "Alpha-Synuclein",
"misfoldingRisk": 87,
"riskLevel": "High Risk",
"confidence": 94,
"structure": {
"alphaHelix": 35.2,
"betaSheet": 28.7,
"randomCoil": 36.1,
"instabilityIndex": 42.3,
"gravy": -0.234
},
"hotspots": [
{
"residue": "A53T",
"position": 53,
"severity": "high",
"confidence": 0.94,
"impact": "Accelerates fibril formation, Parkinson's disease"
}
],
"crisprDesign": {
"gene": "SNCA",
"guideRNAs": [...],
"deliverySystem": "AAV9-PHP.eB",
"successProbability": 89.5
}
}proteomorphic/
βββ index.html # Landing page
βββ logo.png # Project logo
βββ system_architecture.png # Architecture diagram
βββ .gitignore # Git ignore rules
β
βββ auth/ # Authentication pages
β βββ login.html
β βββ signup.html
β βββ verify.html
β
βββ dashboard/ # User dashboard
β βββ index.html
β
βββ analysis/ # Protein analysis
β βββ input.html
β
βββ report/ # Analysis reports
β βββ view.html
β
βββ css/ # Stylesheets
β βββ variables.css # CSS variables
β βββ global.css # Global styles
β βββ components.css # Component styles
β
βββ js/ # JavaScript modules
β βββ auth.js # Authentication logic
β βββ utils.js # Utility functions
β βββ chatbot.js # AI chatbot
β βββ report-data-loader.js # Report data handling
β βββ notifications.js # Notification system
β
βββ backend/ # Python backend
β βββ app.py # Flask API server
β βββ requirements.txt # Python dependencies
β βββ .env # Environment variables
β
βββ images/ # Image assets
β βββ [various images]
β
βββ iot-monitoring.html # IoT dashboard
βββ knowledge.html # Knowledge center
βββ chatbot-widget.html # Chatbot interface
βββ admin.html # Admin panel
Analyze a protein sequence and predict misfolding risk.
Request Body:
{
"proteinName": "string",
"proteinSequence": "string (optional)"
}Response: Complete analysis object with structure, hotspots, and CRISPR design.
Check backend health status.
Response:
{
"status": "healthy",
"model_loaded": true,
"device": "cpu"
}- β Protein structure analysis
- β Misfolding risk prediction
- β CRISPR guide RNA design
- β IoT biosensor integration
- β User authentication
- AlphaFold integration for structure prediction
- Multi-protein complex analysis
- Drug-protein interaction modeling
- Mobile application (iOS/Android)
- Cloud deployment (AWS/Azure)
- Real-time collaborative analysis
- Machine learning model training interface
- Integration with electronic health records (EHR)
- Advanced visualization (VR/AR)
We welcome contributions from the community! Here's how you can help:
- Fork the repository
- Create a feature branch:
git checkout -b feature/AmazingFeature - Commit your changes:
git commit -m 'Add some AmazingFeature' - Push to the branch:
git push origin feature/AmazingFeature - Open a Pull Request
- Follow PEP 8 for Python code
- Use ESLint for JavaScript
- Write clear commit messages
- Add tests for new features
- Update documentation
- π Bug fixes
- β¨ New features
- π Documentation improvements
- π¨ UI/UX enhancements
- π§ͺ Test coverage
- π Internationalization
This project is licensed under the MIT License - see the LICENSE file for details.
MIT License
Copyright (c) 2025 Proteomorphic Team
Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.
- ESM-2: Meta AI's protein language model
- 3Dmol.js: 3D molecular visualization library
- Flask: Python web framework
- Supabase: Backend-as-a-Service platform
- UniProt: Protein sequence database
- AlphaFold: Structure prediction (planned integration)
- Protein Data Bank (PDB)
- National Center for Biotechnology Information (NCBI)
- Neurodegenerative disease research community
- Open-source bioinformatics community
- All contributors and testers
- Academic advisors and mentors
- Open-source community
- GitHub Issues: Report bugs or request features
- Email: proteomorphic@example.com
- Documentation: Full documentation
If you find Proteomorphic useful, please consider:
- β Starring the repository
- π Reporting bugs
- π‘ Suggesting new features
- π’ Sharing with colleagues
Made with β€οΈ for the advancement of computational biology and neurodegenerative disease research




