{"payload":{"feedbackUrl":"https://github.com/orgs/community/discussions/53140","repo":{"id":694465481,"defaultBranch":"main","name":"stat_py","ownerLogin":"elkronos","currentUserCanPush":false,"isFork":false,"isEmpty":false,"createdAt":"2023-09-21T03:50:46.000Z","ownerAvatar":"https://avatars.githubusercontent.com/u/130187349?v=4","public":true,"private":false,"isOrgOwned":false},"refInfo":{"name":"","listCacheKey":"v0:1695268247.0","currentOid":""},"activityList":{"items":[{"before":"623379f8bd5ece81772e9e0197a7151043408e78","after":"5ae1c6ceecba1df0a934b89e6ebe50f856ca48d8","ref":"refs/heads/main","pushedAt":"2024-08-07T01:26:32.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"elkronos","name":null,"path":"/elkronos","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/130187349?s=80&v=4"},"commit":{"message":"fixes and adjustments\n\nClass Implementation:\nRefactored various functions into a static methods within the AssumptionChecks class to centralize the checks for linear regression assumptions.\n\nFunction Refactoring:\nMerged the check_linearity, check_normality, check_homoscedasticity, check_independence, and check_multicollinearity functions into corresponding static methods in the AssumptionChecks class.\n\nImproved Error Handling:\nAdded try-except blocks around assumption checks and plotting to handle and report errors gracefully during the execution of linear_regression_and_check_assumptions.\n\nDocumentation and Comments:\nUpdated docstrings and comments to provide clearer descriptions of each function’s purpose and parameters.\n\nFunction Integration:\nIntegrated individual checks into the linear_regression_and_check_assumptions function, ensuring all assumption checks are performed and errors are logged if they occur.\n\nPlotting Enhancements:\nEnhanced the plotting of regression models and assumption checks by ensuring plots have appropriate titles and annotations.\n\nCode Cleanup:\nRemoved redundant code and streamlined the logic for better readability and maintainability.","shortMessageHtmlLink":"fixes and adjustments"}},{"before":"b263dca0e04bcaac686097fd315caba96dcc4b9a","after":"623379f8bd5ece81772e9e0197a7151043408e78","ref":"refs/heads/main","pushedAt":"2024-08-07T01:21:11.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"elkronos","name":null,"path":"/elkronos","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/130187349?s=80&v=4"},"commit":{"message":"Update xgboost.py\n\nLogging and Configuration:\nIntroduced logging setup for detailed information on the script’s execution.\nAdded a configuration loading function to read parameters from a YAML file, falling back to default settings if the file is not found.\n\nRandom Seed:\nUsed a configuration-defined seed for reproducibility, updating the seed initialization.\n\nData Loading and Preprocessing:\nStandardized the feature scaling process and added an option to normalize the target variable based on the configuration.\nEnsured exceptions during data loading are logged and raised.\n\nDataset Splitting:\nModified the random state for splitting datasets to ensure different splits for training, validation, and test sets.\n\nHyperparameter Tuning:\nSimplified hyperparameter configuration by using values from the loaded configuration.\nIntroduced KFold for cross-validation during hyperparameter tuning.\n\nModel Training and Evaluation:\nRefined the model training function to include cross-validation scores, training time, and additional evaluation metrics.\nAdded error handling and logging for model training and evaluation.\nImproved SHAP plot generation, saving it to a file.\n\nMain Function:\nCreated a main function to encapsulate the execution flow, providing detailed print statements and logging at each step.\nOrganized the process from loading data, splitting datasets, tuning hyperparameters, training the model, and saving results.\n\nSerialization:\nAdded model and scaler serialization for deployment purposes.\nEnhanced the script’s structure and maintainability with clear separation of tasks and better error handling.","shortMessageHtmlLink":"Update xgboost.py"}},{"before":"05ca3e36620b3af486f225f3b07a639db94eb076","after":"b263dca0e04bcaac686097fd315caba96dcc4b9a","ref":"refs/heads/main","pushedAt":"2024-08-03T20:36:11.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"elkronos","name":null,"path":"/elkronos","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/130187349?s=80&v=4"},"commit":{"message":"fixes and updates\n\nexponential_smoother:\nRemoved an unused import statement (matplotlib.pyplot as plt).\nFixed a typo in a return statement: replaced a complex formula with an array (errors) for better readability and functionality.\nChanged the method of generating alphas to np.linspace for evenly spaced values, ensuring they avoid both zero and one.\nModified the return value of exponential_smoother to include only future forecasts instead of all forecasts.\n\nrobust_anova:\nRemoved redundant import statement (from statsmodels.formula.api import ols).\nCorrected the group assignment line in two_way_anova to use the correct column name (response to formula.split('~')[0].strip()).\nModified the repeated_measures_anova function to dynamically extract the dependent variable and within-factors from the formula.\nRemoved multiple example data generation and function calls, leaving only the Mann-Whitney U Test example for clarity.\nFixed indentation and minor formatting issues in the robust_anova function.\n\nt-test:\nAdded an optional significance_level parameter to the check_normality and check_variance functions, replacing the hardcoded constant SIGNIFICANCE_LEVEL.\nUpdated log messages in check_normality and check_variance to include p-values for better clarity.\nAdded a significance_level parameter to the perform_ttest and independent_ttest functions and propagated its use in internal checks.\nEnhanced the plot_means function by specifying the figure size and adding error bar caps for better visualization.\nRemoved redundant constants and streamlined logging configurations.","shortMessageHtmlLink":"fixes and updates"}},{"before":"fffb7da39247b9e5ec41eeeca2a94e2a8b8f950b","after":"05ca3e36620b3af486f225f3b07a639db94eb076","ref":"refs/heads/main","pushedAt":"2024-08-01T04:21:23.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"elkronos","name":null,"path":"/elkronos","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/130187349?s=80&v=4"},"commit":{"message":"Enhancements for polychoric correlation\n\nFunction Enhancement:\n\nAdded validate_start_parameters function to cleanly validate and process the start parameters.\nImproved log_likelihood function to handle parameter clipping and restructuring of row and column cuts more efficiently.\nIntroduced preprocess_data function to handle contingency table creation.\nParameter and Thresholds Handling:\n\nEnhanced start parameter validation to check the type and length of rho, row_thresholds, and col_thresholds.\nUpdated the handling of maximum likelihood (ML) and standard error (std_err) calculation with more robust start values and initial guess processing.\nPerformance and Robustness:\n\nImproved handling of edge cases where tables have fewer than 2 rows or columns.\nAdded informative print statements to notify when zero marginal rows or columns are removed.\nUnit Testing:\n\nExpanded unit tests to include edge cases for few rows and columns, invalid start values, and performance testing on large inputs.\nVerified the inclusion of thresholds in the output when requested.\nDocumentation:\n\nAdded comprehensive docstrings to new helper functions and updated existing ones for clarity and completeness.\nDetailed Code Changes:\npolychor Function:\n\nAdded nested functions: validate_start_parameters, log_likelihood, and preprocess_data.\nUpdated main logic to utilize these nested functions for improved readability and maintenance.\nRefined parameter validation and error handling mechanisms.\nEnhanced maximum likelihood estimation with better initial guess handling.\nbinBvn Function:\n\nNo significant changes to logic but included better threshold handling.\nUnit Tests:\n\nAdded TestPolychor class with new tests for various scenarios and edge cases.\nEnsured tests cover normal operations, edge cases, and performance checks.","shortMessageHtmlLink":"Enhancements for polychoric correlation"}},{"before":"d0f5ee3c2cc4506813b513b75c60ea2cdda519ba","after":"fffb7da39247b9e5ec41eeeca2a94e2a8b8f950b","ref":"refs/heads/main","pushedAt":"2024-07-04T07:25:39.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"elkronos","name":null,"path":"/elkronos","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/130187349?s=80&v=4"},"commit":{"message":"Rename PolychoricCorrelation.py to polychoric_correlation.py","shortMessageHtmlLink":"Rename PolychoricCorrelation.py to polychoric_correlation.py"}},{"before":"334dd85f2ccabc3807883a6e1ca6cfafb947a4bc","after":"d0f5ee3c2cc4506813b513b75c60ea2cdda519ba","ref":"refs/heads/main","pushedAt":"2024-07-04T07:25:23.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"elkronos","name":null,"path":"/elkronos","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/130187349?s=80&v=4"},"commit":{"message":"Rename MNISTClassifierWithEarlyStopping.py to mnist_classifier_early_stopping.py","shortMessageHtmlLink":"Rename MNISTClassifierWithEarlyStopping.py to mnist_classifier_early_…"}},{"before":"85930cf0a1137002add61b238f2b4560d8d0f997","after":"334dd85f2ccabc3807883a6e1ca6cfafb947a4bc","ref":"refs/heads/main","pushedAt":"2024-07-04T07:24:16.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"elkronos","name":null,"path":"/elkronos","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/130187349?s=80&v=4"},"commit":{"message":"Update README.md\n\nChanged order","shortMessageHtmlLink":"Update README.md"}},{"before":"95257ae21a3e780ab5300f600f957845ff860601","after":"85930cf0a1137002add61b238f2b4560d8d0f997","ref":"refs/heads/main","pushedAt":"2024-07-04T07:22:49.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"elkronos","name":null,"path":"/elkronos","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/130187349?s=80&v=4"},"commit":{"message":"Added outlier detection\n\nAdded outlier detection and visualization script\n\n- Implemented multiple outlier detection methods: Z-score, Tukey's Fences, Mahalanobis Distance, Grubbs' Test, Median Absolute Deviation (MAD), Iglewicz and Hoaglin's Modified Z-score, Isolation Forest, DBSCAN, One-Class SVM, Elliptic Envelope, and Local Outlier Factor (LOF).\n- Included utility functions for validating parameters, detecting outliers, and visualizing results.\n- Enhanced visualizations with bar plots and scatter plots.","shortMessageHtmlLink":"Added outlier detection"}},{"before":"200ba46f57ff3b3ad9d0cb46ad4883c7ee4370a7","after":"95257ae21a3e780ab5300f600f957845ff860601","ref":"refs/heads/main","pushedAt":"2024-07-04T07:21:25.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"elkronos","name":null,"path":"/elkronos","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/130187349?s=80&v=4"},"commit":{"message":"Update README.md\n\nAdded new function description for outlier detector. Also adjusted order.","shortMessageHtmlLink":"Update README.md"}},{"before":"945dea359dc5deca42f887891cb6da33faf2b218","after":"200ba46f57ff3b3ad9d0cb46ad4883c7ee4370a7","ref":"refs/heads/main","pushedAt":"2024-06-05T04:18:42.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"elkronos","name":null,"path":"/elkronos","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/130187349?s=80&v=4"},"commit":{"message":"Update README.md\n\nUpdated to include exponential smoother function.","shortMessageHtmlLink":"Update README.md"}},{"before":"181b1ae2f6be04a498ded695e1430c3aaaee562f","after":"945dea359dc5deca42f887891cb6da33faf2b218","ref":"refs/heads/main","pushedAt":"2024-06-05T04:12:12.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"elkronos","name":null,"path":"/elkronos","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/130187349?s=80&v=4"},"commit":{"message":"Exponential smoother\n\nFunction to forecast using an exp smooth algo.","shortMessageHtmlLink":"Exponential smoother"}},{"before":"b57516310ee646299b12f1f50443a43c619da77b","after":"181b1ae2f6be04a498ded695e1430c3aaaee562f","ref":"refs/heads/main","pushedAt":"2024-01-03T04:38:23.000Z","pushType":"push","commitsCount":2,"pusher":{"login":"elkronos","name":null,"path":"/elkronos","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/130187349?s=80&v=4"},"commit":{"message":"Merge branch 'main' of https://github.com/elkronos/stat_py","shortMessageHtmlLink":"Merge branch 'main' of https://github.com/elkronos/stat_py"}},{"before":"48eefb4559bc583f964905c160286e16dec0e9fe","after":"b57516310ee646299b12f1f50443a43c619da77b","ref":"refs/heads/main","pushedAt":"2024-01-03T04:37:28.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"elkronos","name":null,"path":"/elkronos","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/130187349?s=80&v=4"},"commit":{"message":"Update README.md","shortMessageHtmlLink":"Update README.md"}},{"before":"40d7ae2fd4923419ae55d15c26e1f946ceb1178b","after":"48eefb4559bc583f964905c160286e16dec0e9fe","ref":"refs/heads/main","pushedAt":"2023-12-05T07:21:16.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"elkronos","name":null,"path":"/elkronos","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/130187349?s=80&v=4"},"commit":{"message":"xgboost","shortMessageHtmlLink":"xgboost"}},{"before":"9734d347d3901b91f8849a859373353adbb5eeec","after":"40d7ae2fd4923419ae55d15c26e1f946ceb1178b","ref":"refs/heads/main","pushedAt":"2023-11-25T22:33:37.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"elkronos","name":null,"path":"/elkronos","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/130187349?s=80&v=4"},"commit":{"message":"Script additions\n\nAdding scripts to demonstrate a neural network, and polychoric correlations in python.","shortMessageHtmlLink":"Script additions"}},{"before":"b49ee8fb6aff0975d7841dabfe8e8a4d1115f1f6","after":"9734d347d3901b91f8849a859373353adbb5eeec","ref":"refs/heads/main","pushedAt":"2023-09-22T01:36:57.000Z","pushType":"push","commitsCount":2,"pusher":{"login":"elkronos","name":null,"path":"/elkronos","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/130187349?s=80&v=4"},"commit":{"message":"Merge branch 'main' of https://github.com/elkronos/stat_py","shortMessageHtmlLink":"Merge branch 'main' of https://github.com/elkronos/stat_py"}},{"before":"a109b779bf34b256e89da433ba7b6e5310f85a0d","after":"b49ee8fb6aff0975d7841dabfe8e8a4d1115f1f6","ref":"refs/heads/main","pushedAt":"2023-09-22T01:35:54.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"elkronos","name":null,"path":"/elkronos","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/130187349?s=80&v=4"},"commit":{"message":"Update README.md\n\nAdded description for t-test function","shortMessageHtmlLink":"Update README.md"}},{"before":"32697bcd5190bf3ab75bde16e96c04d56d143d37","after":"a109b779bf34b256e89da433ba7b6e5310f85a0d","ref":"refs/heads/main","pushedAt":"2023-09-21T03:54:25.000Z","pushType":"push","commitsCount":1,"pusher":{"login":"elkronos","name":null,"path":"/elkronos","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/130187349?s=80&v=4"},"commit":{"message":"Create README.md\n\nAdded description for repo and linear regression and assumptions checker.","shortMessageHtmlLink":"Create README.md"}},{"before":null,"after":"32697bcd5190bf3ab75bde16e96c04d56d143d37","ref":"refs/heads/main","pushedAt":"2023-09-21T03:50:47.000Z","pushType":"branch_creation","commitsCount":0,"pusher":{"login":"elkronos","name":null,"path":"/elkronos","primaryAvatarUrl":"https://avatars.githubusercontent.com/u/130187349?s=80&v=4"},"commit":{"message":"Initial commit","shortMessageHtmlLink":"Initial commit"}}],"hasNextPage":false,"hasPreviousPage":false,"activityType":"all","actor":null,"timePeriod":"all","sort":"DESC","perPage":30,"cursor":"djE6ks8AAAAEk6pBiAA","startCursor":null,"endCursor":null}},"title":"Activity · elkronos/stat_py"}