New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add postgresql analyzer #1084

Open

roshanmaskey wants to merge 4 commits into google:master from roshanmaskey:add-postgresql-analyzer

Contributor

roshanmaskey commented Jul 18, 2022

Postgres analyzer that analyzes postgresql installation and provides a TLDR and details findings report.

Postgresql configuration - postgresql.conf
Client authentication - pg_hba.conf
Linux postgres account
Postgres account bash history and psql history
Postgresql logs

roshanmaskey and others added 4 commits

July 13, 2022 03:28


          Added extract_custom_artifacts function in utils.py

f607531


          Initial commit PostgreSQLAnalysisTask

0cf7d4e


          Updated _analyze_postgresql_config

d07e965


          Merge branch 'master' into add-postgresql-analyzer

423512e

aarontp reviewed

View reviewed changes

Member

aarontp left a comment

Wow, this is really comprehensive, thanks!

Here are some initial comments, though I haven't gone through the entire PR yet so feel free to ignore these comments until I do a second pass tomorrow. Mostly just some small code style kinds of nits.

FYI, I fixed the recent merge conflict too.

turbinia/workers/analysis/postgresql.py

+              _BASH_HISTORY = '.bash_history'
+              _PSQL_HISTORY = '.psql_history'
+              _POSTGRESQL_ARTIFACTS = """---

Member

aarontp Jul 20, 2022

I think this is good here for now, but should we upstream this Artifact into the main forensics artifact repo?

Member

aarontp Jul 20, 2022

I just realized that these artifacts are already in the ForensicArtifacts repo, or at least ones with the same name are there. Is there anything custom about these? If not, we can probably just use those by name with extract_artifacts() instead of re-defining it here.

turbinia/workers/analysis/postgresql.py

+                  # Module: PostgreSQL Configuration Analysis
+                  # 1.1. find postgresql.conf and copy to artifact directory
+                  try:
+                    artifact_locations, err = self._collect_artifact(

Member

aarontp Jul 20, 2022

Looks like you're probably coming from golang based on the patterns here :).

I don't actually see err being returned with a value other than None in the method being called here, though there are other places that TurbiniaException is being raised. Rather than duplicating the error handling, and also to match style with the rest of the codebase (as well as be more idiomatic python), should we change these to just return the artifact locations and keep the try/except block error handling for these calls?

Contributor Author

roshanmaskey Jul 20, 2022

Ack the recommendation. Removing unsued err return from code.

turbinia/workers/analysis/postgresql.py

+                      return result
+                    if not artifact_locations:
+                      result.close(self, success=False,

Member

aarontp Jul 20, 2022

There are a few locations that don't quite match the style guide and will probably fail the yapf unit test (e.g. normally the newline comes after the method name and paren here). Do you want to run yapf on the codebase to auto-fix some of these? If you installed the dev dependencies (pip install -e .[dev]) you should have yapf installed and can do something like yapf -i -r --style .style.yapf ./turbinia/.

Contributor Author

roshanmaskey Jul 20, 2022

I will run yapf and fix the style.

turbinia/workers/analysis/postgresql.py


		return final_report, final_priority, final_summary

		def read_file(filepath):

Member

aarontp Jul 20, 2022

We also have a file_to_str util method for this that checks for a couple error conditions (though I'm not sure why we don't raise TurbiniaExceptions there for errors). Could we potentially use that or check for errors here?

turbinia/workers/analysis/postgresql.py

+                  return artifacts, None
+                def _get_artifact_disk_path(self, collected_artifact_path):
+                  """Returns the artifact disk path.

Member

aarontp Jul 20, 2022

Maybe just to be a bit clearer here, this could be something like "Returns the absolute path of the artifact on the original disk without the local mount prefix" or similar? That probably won't fit on one line though, so maybe add it as a second line or something?

Contributor Author

roshanmaskey Jul 20, 2022

Done - Updated with a more clear description.

turbinia/workers/analysis/postgresql.py

+                    (report, priority, summary), err = self._analyze_postgresql_config(
+                        config_data)
+                    if err:

Member

aarontp Jul 20, 2022

See comments below, but we might be able to remove this.

Contributor Author

roshanmaskey Jul 21, 2022

Fixed.

turbinia/workers/analysis/postgresql.py

+                    if not artifact_locations:
+                      result.close(self, success=False,
+                          status='Error setting artifact location')

Member

aarontp Jul 20, 2022

Should this be "No pg_hba.conf found" as well? If not, maybe we can clarify what this means a bit?

Contributor Author

roshanmaskey Jul 21, 2022

Replaced with "pg_hba.conf not found"

turbinia/workers/analysis/postgresql.py

+                    for artifact_location in artifact_locations:
+                      # we only want to process /etc/passwd
+                      if '/etc/passwd' not in artifact_location:
+                        result.log(f'Ignore passwd file {artifact_location}')

Member

aarontp Jul 20, 2022

Maybe to be explicit here you could say "Ignoring filename {} not matching '/etc/passwd'"?

Contributor Author

roshanmaskey Jul 20, 2022

Done

turbinia/workers/analysis/postgresql.py

+                        result.log(f'Ignore passwd file {artifact_location}')
+                        continue
+                      result.log(f'Processing passwd: {artifact_location}')

Member

aarontp Jul 20, 2022

s/passwd/passwd file/

Contributor Author

roshanmaskey Jul 20, 2022

Done

turbinia/workers/analysis/postgresql.py

+                  # Module: User Bash History Analysis
+                  # It includes all user bash history including postgres user account.
+                  # 4. Find and analyze .bash_history
+                  artifact_locations, err = self._collect_artifact(_BASH_HISTORY, evidence)

Member

aarontp Jul 20, 2022

Should we just use the already existing BashShellHistoryFile artifact for this? That should also handle the directory so we don't need to search the whole disk for it as well (though we should make sure that it actually works with a mounted disk due to the mount prefix).

aarontp reviewed

View reviewed changes

Member

aarontp left a comment

LG, just some more small comments.

turbinia/workers/analysis/postgresql.py Show resolved Hide resolved

turbinia/workers/analysis/postgresql.py Show resolved Hide resolved

turbinia/workers/analysis/postgresql.py

+                          summary=summary, report=report))
+                  # Module: PostgreSQL Database User Analysis
+                  # 6. Find and analyze PostgreSQL database user analysis.

Member

aarontp Jul 20, 2022

small nit: s/user analysis/users/

Member

aarontp Jul 20, 2022

Actually, should we just remove this whole comment block? It looks like it might be for functionality that was removed?

Contributor Author

roshanmaskey Jul 21, 2022

Done.

turbinia/workers/analysis/postgresql.py

+                  with open(artifact_definition_file, 'wb') as fh:
+                    fh.write(_POSTGRESQL_ARTIFACTS.encode('utf8'))
+                  artifact_names = ['PostgreSQLLogFiles']

Member

aarontp Jul 20, 2022

See comment above, but unless this artifact actually has some differences from the one in the ForensicArtifacts repo, we should be able to just use the extract_artifacts() method and extract it by name instead of writing out the yaml file.

turbinia/workers/analysis/postgresql.py

+              _BASH_HISTORY = '.bash_history'
+              _PSQL_HISTORY = '.psql_history'
+              _POSTGRESQL_ARTIFACTS = """---

Member

aarontp Jul 20, 2022

I just realized that these artifacts are already in the ForensicArtifacts repo, or at least ones with the same name are there. Is there anything custom about these? If not, we can probably just use those by name with extract_artifacts() instead of re-defining it here.

turbinia/workers/analysis/postgresql.py

+                    for line in re.findall(pattern, data):
+                      if module_priority > event_priority:
+                        module_priority = event_priority
+                        module_summary = f'{name} detected'

Member

aarontp Jul 20, 2022

Do we want to do something here similar to one of the other analyzer methods here that counts the number of findings and adds that as a summary instead of just using the last one found?

turbinia/workers/analysis/postgresql.py

+                  report.insert(4, '{0:s}\n'.format(fmt.heading2('Detailed Analysis')))
+                  final_report = '\n'.join(report)
+                  final_summary = '\n'.join(x[1] for x in summary)

Member

aarontp Jul 20, 2022

The value that gets used as the final status as part of the result (as set in result.close()) should be a single line so that it fits into the final report for all Tasks. Should we change this to be something like N findings reported or similar?

turbinia/workers/analysis/postgresql_test.py

+                # Evidence mount point location i.e. Evidence.local_path
+                # Use export EVIDENCE_LOCAL_PATH='/mnt/mock' where test image is mounted
+                # to /mnt/mock
+                EVIDENCE_LOCAL_PATH = os.environ.get('EVIDENCE_LOCAL_PATH')

Member

aarontp Jul 20, 2022

What uses this?

turbinia/workers/analysis/postgresql_test.py

+                  task = postgresql.PostgreSQLAnalysisTask()
+                  # pylint: disable=protected-access
+                  pg_config = task._read_postgresql_config(self.POSTGRESQL_CONF)

Member

aarontp Jul 20, 2022

For some reason I can't seem to find this method? Is this supposed to be read_file()?

Contributor Author

roshanmaskey Jul 21, 2022

Old function that is no longer in use. Removed from the test file.

turbinia/workers/analysis/postgresql_test.py



		class PostgreSQLAnalysisTaskTest(TestTurbiniaTaskBase):
		"""Tests for PostgreSQLAnalysisTask."""

Member

aarontp Jul 20, 2022

Can you also add a quick test for the run() method and make sure that the result output is what you're expecting?

jleaniz assigned roshanmaskey

jleaniz added the new-task label

Member

aarontp commented Sep 7, 2023

Hi @roshanmaskey , Just wanted to check in on this one to see if you had any updates.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels