Research Question




“Can workforce composition, promotion and resignation patterns, and organisational policies predict the level of female representation in management positions?”

Data Description – Overview

Source: Workplace Gender Equality Agency (WGEA) Public Dataset 2024

  • Legally mandated survey responses from 7,415 Australian employers (≥100 staff)
  • 7 linked datasets covering workforce composition and organisational policies
Workforce Composition Data Policy Questionnaire Data
  • 📊 Workforce Composition | - 🏡 Flexible Work | | |
  • 🧑‍💼 Workforce Management Statistics | - 🛡 Harm Prevention | | | - ❤️ Employee Support | | | | - ⚖️ Action on Gender Equality | | | | - 🧾 Workplace Overview |

Data Description – Target Variable

Female representation in management (as a percentage of total managers):

  • Represented by five equally sized ordinal classes (quintiles)

🟥 Very Low  🟨 Low  🟩 Moderate   🟦High  🟪 Very High

Constructed using data from Workforce Management Statistics:

% female managers = Number of Female Managers / Total Managers

Data Description – Challenges

Integration: Datasets in long format, linked by employer_abn.

31b8e172-b470-440e-83d8-e6b185028602:dAB5AHAAZQA6AE8AQQBCAGwAQQBHAFkAQQBOAFEAQgBoAEEARABjAEEATgB3AEEAeQBBAEMAMABBAFoAQQBCAGsAQQBEAFkAQQBNAHcAQQB0AEEARABRAEEATgBnAEEAeQBBAEQASQBBAEwAUQBBADQAQQBEAFEAQQBZAGcAQgBtAEEAQwAwAEEAWQBRAEIAbQBBAEQARQBBAE8AUQBBADUAQQBEAFUAQQBZAFEAQQB4AEEARwBJAEEATQBnAEIAaQBBAEQAawBBAAoAcABvAHMAaQB0AGkAbwBuADoATgBBAEEAMQBBAEQAVQBBAE0AZwBBAD0ACgBwAHIAZQBmAGkAeAA6AAoAcwBvAHUAcgBjAGUAOgBQAEEAQgAwAEEARwBFAEEAWQBnAEIAcwBBAEcAVQBBAEkAQQBCAHoAQQBIAFEAQQBlAFEAQgBzAEEARwBVAEEAUABRAEEAaQBBAEEAbwBBAEkAQQBBAGcAQQBIAGMAQQBhAFEAQgBrAEEASABRAEEAYQBBAEEANgBBAEQARQBBAE0AQQBBAHcAQQBDAFUAQQBPAHcAQQBLAEEAQwBBAEEASQBBAEIAaQBBAEcAOABBAGMAZwBCAGsAQQBHAFUAQQBjAGcAQQB0AEEARwBNAEEAYgB3AEIAcwBBAEcAdwBBAFkAUQBCAHcAQQBIAE0AQQBaAFEAQQA2AEEARwBNAEEAYgB3AEIAcwBBAEcAdwBBAFkAUQBCAHcAQQBIAE0AQQBaAFEAQQA3AEEAQQBvAEEASQBBAEEAZwBBAEcAWQBBAGIAdwBCAHUAQQBIAFEAQQBMAFEAQgB6AEEARwBrAEEAZQBnAEIAbABBAEQAbwBBAE0AUQBBAHUAQQBEAEEAQQBOAFEAQgBsAEEARwAwAEEATwB3AEEASwBBAEMAQQBBAEkAQQBCAHQAQQBHAEUAQQBjAGcAQgBuAEEARwBrAEEAYgBnAEEAdABBAEgAUQBBAGIAdwBCAHcAQQBEAG8AQQBNAFEAQQB5AEEASABBAEEAZQBBAEEANwBBAEEAbwBBAEkAQQBBAGcAQQBIAFEAQQBaAFEAQgA0AEEASABRAEEATABRAEIAaABBAEcAdwBBAGEAUQBCAG4AQQBHADQAQQBPAGcAQgBzAEEARwBVAEEAWgBnAEIAMABBAEQAcwBBAEMAZwBBAGcAQQBDAEEAQQBZAGcAQgB2AEEASABJAEEAWgBBAEIAbABBAEgASQBBAEwAUQBCAHkAQQBHAEUAQQBaAEEAQgBwAEEASABVAEEAYwB3AEEANgBBAEQARQBBAE0AQQBCAHcAQQBIAGcAQQBPAHcAQQBLAEEAQwBBAEEASQBBAEIAdgBBAEgAWQBBAFoAUQBCAHkAQQBHAFkAQQBiAEEAQgB2AEEASABjAEEATwBnAEIAbwBBAEcAawBBAFoAQQBCAGsAQQBHAFUAQQBiAGcAQQA3AEEAQQBvAEEASQBBAEEAZwBBAEcASQBBAGIAdwBCADQAQQBDADAAQQBjAHcAQgBvAEEARwBFAEEAWgBBAEIAdgBBAEgAYwBBAE8AZwBBAHcAQQBDAEEAQQBNAGcAQgB3AEEASABnAEEASQBBAEEAMgBBAEgAQQBBAGUAQQBBAGcAQQBIAEkAQQBaAHcAQgBpAEEARwBFAEEASwBBAEEAdwBBAEMAdwBBAE0AQQBBAHMAQQBEAEEAQQBMAEEAQQB3AEEAQwA0AEEATQBBAEEANABBAEMAawBBAE8AdwBBAEsAQQBDAEkAQQBQAGcAQQBLAEEAQwBBAEEASQBBAEEAOABBAEgAUQBBAGEAQQBCAGwAQQBHAEUAQQBaAEEAQQArAEEAQQBvAEEASQBBAEEAZwBBAEMAQQBBAEkAQQBBADgAQQBIAFEAQQBjAGcAQQBnAEEASABNAEEAZABBAEIANQBBAEcAdwBBAFoAUQBBADkAQQBDAEkAQQBDAGcAQQBnAEEAQwBBAEEASQBBAEEAZwBBAEMAQQBBAEkAQQBCAGkAQQBHAEUAQQBZAHcAQgByAEEARwBjAEEAYwBnAEIAdgBBAEgAVQBBAGIAZwBCAGsAQQBDADAAQQBZAHcAQgB2AEEARwB3AEEAYgB3AEIAeQBBAEQAbwBBAEkAdwBCAG0AQQBEAFUAQQBaAGcAQQAxAEEARwBZAEEATgBRAEEANwBBAEEAbwBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQQBnAEEAQwBBAEEAWQB3AEIAdgBBAEcAdwBBAGIAdwBCAHkAQQBEAG8AQQBJAHcAQQB6AEEARABNAEEATQB3AEEANwBBAEEAbwBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQQBnAEEAQwBBAEEAWgBnAEIAdgBBAEcANABBAGQAQQBBAHQAQQBIAGMAQQBaAFEAQgBwAEEARwBjAEEAYQBBAEIAMABBAEQAbwBBAE4AdwBBAHcAQQBEAEEAQQBPAHcAQQBLAEEAQwBBAEEASQBBAEEAZwBBAEMAQQBBAEkAQQBBAGcAQQBHAFkAQQBiAHcAQgB1AEEASABRAEEATABRAEIAegBBAEcAawBBAGUAZwBCAGwAQQBEAG8AQQBNAFEAQQB1AEEARABBAEEATgBRAEIAbABBAEcAMABBAE8AdwBBAEsAQQBDAEEAQQBJAEEAQQBnAEEAQwBBAEEASQBBAEEAZwBBAEgAUQBBAFoAUQBCADQAQQBIAFEAQQBMAFEAQgAwAEEASABJAEEAWQBRAEIAdQBBAEgATQBBAFoAZwBCAHYAQQBIAEkAQQBiAFEAQQA2AEEASABVAEEAYwBBAEIAdwBBAEcAVQBBAGMAZwBCAGoAQQBHAEUAQQBjAHcAQgBsAEEARABzAEEAQwBnAEEAZwBBAEMAQQBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQgBzAEEARwBVAEEAZABBAEIAMABBAEcAVQBBAGMAZwBBAHQAQQBIAE0AQQBjAEEAQgBoAEEARwBNAEEAYQBRAEIAdQBBAEcAYwBBAE8AZwBBAHcAQQBDADQAQQBOAFEAQgB3AEEASABnAEEATwB3AEEASwBBAEMAQQBBAEkAQQBBAGcAQQBDAEEAQQBJAGcAQQArAEEAQQBvAEEASQBBAEEAZwBBAEMAQQBBAEkAQQBBAGcAQQBDAEEAQQBQAEEAQgAwAEEARwBnAEEASQBBAEIAegBBAEgAUQBBAGUAUQBCAHMAQQBHAFUAQQBQAFEAQQBpAEEASABBAEEAWQBRAEIAawBBAEcAUQBBAGEAUQBCAHUAQQBHAGMAQQBPAGcAQQB4AEEARABRAEEAYwBBAEIANABBAEQAcwBBAEkAQQBCAGkAQQBHADgAQQBjAGcAQgBrAEEARwBVAEEAYwBnAEEAdABBAEcASQBBAGIAdwBCADAAQQBIAFEAQQBiAHcAQgB0AEEARABvAEEATQBnAEIAdwBBAEgAZwBBAEkAQQBCAHoAQQBHADgAQQBiAEEAQgBwAEEARwBRAEEASQBBAEEAagBBAEcAVQBBAE0AQQBCAGwAQQBEAEEAQQBaAFEAQQB3AEEARABzAEEASQBnAEEAKwBBAEMAQQBBAFYAdwBCAHYAQQBIAEkAQQBhAHcAQgBtAEEARwA4AEEAYwBnAEIAagBBAEcAVQBBAEkAQQBCAEQAQQBHADgAQQBiAFEAQgB3AEEARwA4AEEAYwB3AEIAcABBAEgAUQBBAGEAUQBCAHYAQQBHADQAQQBQAEEAQQB2AEEASABRAEEAYQBBAEEAKwBBAEEAbwBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQQA4AEEAQwA4AEEAZABBAEIAeQBBAEQANABBAEMAZwBBAGcAQQBDAEEAQQBQAEEAQQB2AEEASABRAEEAYQBBAEIAbABBAEcARQBBAFoAQQBBACsAQQBBAG8AQQBJAEEAQQBnAEEARAB3AEEAZABBAEIAaQBBAEcAOABBAFoAQQBCADUAQQBEADQAQQBDAGcAQQBnAEEAQwBBAEEASQBBAEEAZwBBAEQAdwBBAGQAQQBCAHkAQQBEADQAQQBDAGcAQQBnAEEAQwBBAEEASQBBAEEAZwBBAEMAQQBBAEkAQQBBADgAQQBIAFEAQQBaAEEAQQBnAEEASABNAEEAZABBAEIANQBBAEcAdwBBAFoAUQBBADkAQQBDAEkAQQBkAGcAQgBsAEEASABJAEEAZABBAEIAcABBAEcATQBBAFkAUQBCAHMAQQBDADAAQQBZAFEAQgBzAEEARwBrAEEAWgB3AEIAdQBBAEQAbwBBAGQAQQBCAHYAQQBIAEEAQQBPAHcAQQBnAEEASABBAEEAWQBRAEIAawBBAEcAUQBBAGEAUQBCAHUAQQBHAGMAQQBPAGcAQQB4AEEARABRAEEAYwBBAEIANABBAEMAQQBBAE0AUQBBADQAQQBIAEEAQQBlAEEAQQA3AEEAQwBBAEEAYgBBAEIAcABBAEcANABBAFoAUQBBAHQAQQBHAGcAQQBaAFEAQgBwAEEARwBjAEEAYQBBAEIAMABBAEQAbwBBAE0AUQBBAHUAQQBEAGMAQQBaAFEAQgB0AEEARABzAEEASQBnAEEAKwBBAEEAbwBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQQBnAEEAQwBBAEEASQBBAEEAZwBBAEQAdwBBAGQAQQBCAGgAQQBHAEkAQQBiAEEAQgBsAEEAQwBBAEEAYwB3AEIAMABBAEgAawBBAGIAQQBCAGwAQQBEADAAQQBJAGcAQgAzAEEARwBrAEEAWgBBAEIAMABBAEcAZwBBAE8AZwBBAHgAQQBEAEEAQQBNAEEAQQBsAEEARABzAEEASQBBAEIAaQBBAEcAOABBAGMAZwBCAGsAQQBHAFUAQQBjAGcAQQB0AEEARwBNAEEAYgB3AEIAcwBBAEcAdwBBAFkAUQBCAHcAQQBIAE0AQQBaAFEAQQA2AEEARwBNAEEAYgB3AEIAcwBBAEcAdwBBAFkAUQBCAHcAQQBIAE0AQQBaAFEAQQA3AEEAQwBBAEEAWgBnAEIAdgBBAEcANABBAGQAQQBBAHQAQQBIAE0AQQBhAFEAQgA2AEEARwBVAEEATwBnAEEAdwBBAEMANABBAE8AUQBBADEAQQBHAFUAQQBiAFEAQQA3AEEAQwBJAEEAUABnAEEASwBBAEMAQQBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQQBnAEEAQwBBAEEASQBBAEEAZwBBAEMAQQBBAFAAQQBCADAAQQBHAGcAQQBaAFEAQgBoAEEARwBRAEEAUABnAEEASwBBAEMAQQBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQQBnAEEAQwBBAEEASQBBAEEAZwBBAEMAQQBBAEkAQQBBAGcAQQBEAHcAQQBkAEEAQgB5AEEAQwBBAEEAYwB3AEIAMABBAEgAawBBAGIAQQBCAGwAQQBEADAAQQBJAGcAQgBpAEEARwBFAEEAWQB3AEIAcgBBAEcAYwBBAGMAZwBCAHYAQQBIAFUAQQBiAGcAQgBrAEEAQwAwAEEAWQB3AEIAdgBBAEcAdwBBAGIAdwBCAHkAQQBEAG8AQQBJAHcAQgBtAEEARwBFAEEAWgBnAEIAaABBAEcAWQBBAFkAUQBBADcAQQBDAEEAQQBaAGcAQgB2AEEARwA0AEEAZABBAEEAdABBAEgAYwBBAFoAUQBCAHAAQQBHAGMAQQBhAEEAQgAwAEEARABvAEEATgBnAEEAdwBBAEQAQQBBAE8AdwBBAGkAQQBEADQAQQBDAGcAQQBnAEEAQwBBAEEASQBBAEEAZwBBAEMAQQBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQQBnAEEAQwBBAEEASQBBAEEAZwBBAEMAQQBBAFAAQQBCADAAQQBHAGcAQQBJAEEAQgB6AEEASABRAEEAZQBRAEIAcwBBAEcAVQBBAFAAUQBBAGkAQQBIAEEAQQBZAFEAQgBrAEEARwBRAEEAYQBRAEIAdQBBAEcAYwBBAE8AZwBBADQAQQBIAEEAQQBlAEEAQQA3AEEAQwBJAEEAUABnAEIAbABBAEcAMABBAGMAQQBCAHMAQQBHADgAQQBlAFEAQgBsAEEASABJAEEAWAB3AEIAaABBAEcASQBBAGIAZwBBADgAQQBDADgAQQBkAEEAQgBvAEEARAA0AEEAQwBnAEEAZwBBAEMAQQBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQQBnAEEAQwBBAEEASQBBAEEAZwBBAEMAQQBBAEkAQQBBAGcAQQBDAEEAQQBQAEEAQgAwAEEARwBnAEEASQBBAEIAegBBAEgAUQBBAGUAUQBCAHMAQQBHAFUAQQBQAFEAQQBpAEEASABBAEEAWQBRAEIAawBBAEcAUQBBAGEAUQBCAHUAQQBHAGMAQQBPAGcAQQA0AEEASABBAEEAZQBBAEEANwBBAEMASQBBAFAAZwBCAHYAQQBHAE0AQQBZAHcAQgAxAEEASABBAEEAWQBRAEIAMABBAEcAawBBAGIAdwBCAHUAQQBEAHcAQQBMAHcAQgAwAEEARwBnAEEAUABnAEEASwBBAEMAQQBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQQBnAEEAQwBBAEEASQBBAEEAZwBBAEMAQQBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQQA4AEEASABRAEEAYQBBAEEAZwBBAEgATQBBAGQAQQBCADUAQQBHAHcAQQBaAFEAQQA5AEEAQwBJAEEAYwBBAEIAaABBAEcAUQBBAFoAQQBCAHAAQQBHADQAQQBaAHcAQQA2AEEARABnAEEAYwBBAEIANABBAEQAcwBBAEkAZwBBACsAQQBHAGMAQQBaAFEAQgB1AEEARwBRAEEAWgBRAEIAeQBBAEQAdwBBAEwAdwBCADAAQQBHAGcAQQBQAGcAQQBLAEEAQwBBAEEASQBBAEEAZwBBAEMAQQBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQQBnAEEAQwBBAEEASQBBAEEAZwBBAEMAQQBBAEkAQQBBADgAQQBIAFEAQQBhAEEAQQBnAEEASABNAEEAZABBAEIANQBBAEcAdwBBAFoAUQBBADkAQQBDAEkAQQBjAEEAQgBoAEEARwBRAEEAWgBBAEIAcABBAEcANABBAFoAdwBBADYAQQBEAGcAQQBjAEEAQgA0AEEARABzAEEASQBnAEEAKwBBAEcAZwBBAFoAUQBCAGgAQQBHAFEAQQBZAHcAQgB2AEEASABVAEEAYgBnAEIAMABBAEQAdwBBAEwAdwBCADAAQQBHAGcAQQBQAGcAQQBLAEEAQwBBAEEASQBBAEEAZwBBAEMAQQBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQQBnAEEAQwBBAEEASQBBAEEAZwBBAEQAdwBBAEwAdwBCADAAQQBIAEkAQQBQAGcAQQBLAEEAQwBBAEEASQBBAEEAZwBBAEMAQQBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQQBnAEEAQwBBAEEAUABBAEEAdgBBAEgAUQBBAGEAQQBCAGwAQQBHAEUAQQBaAEEAQQArAEEAQQBvAEEASQBBAEEAZwBBAEMAQQBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQQBnAEEAQwBBAEEASQBBAEEAOABBAEgAUQBBAFkAZwBCAHYAQQBHAFEAQQBlAFEAQQArAEEAQQBvAEEASQBBAEEAZwBBAEMAQQBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQQBnAEEAQwBBAEEASQBBAEEAZwBBAEMAQQBBAFAAQQBCADAAQQBIAEkAQQBQAGcAQQA4AEEASABRAEEAWgBBAEEAZwBBAEgATQBBAGQAQQBCADUAQQBHAHcAQQBaAFEAQQA5AEEAQwBJAEEAYwBBAEIAaABBAEcAUQBBAFoAQQBCAHAAQQBHADQAQQBaAHcAQQA2AEEARABnAEEAYwBBAEIANABBAEQAcwBBAEkAZwBBACsAQQBEAEUAQQBNAEEAQQB3AEEARABFAEEAUABBAEEAdgBBAEgAUQBBAFoAQQBBACsAQQBEAHcAQQBkAEEAQgBrAEEARAA0AEEAVABRAEIAaABBAEcANABBAFkAUQBCAG4AQQBHAFUAQQBjAGcAQQA4AEEAQwA4AEEAZABBAEIAawBBAEQANABBAFAAQQBCADAAQQBHAFEAQQBQAGcAQgBOAEEARwBFAEEAYgBBAEIAbABBAEQAdwBBAEwAdwBCADAAQQBHAFEAQQBQAGcAQQA4AEEASABRAEEAWgBBAEEAKwBBAEQAVQBBAFAAQQBBAHYAQQBIAFEAQQBaAEEAQQArAEEARAB3AEEATAB3AEIAMABBAEgASQBBAFAAZwBBAEsAQQBDAEEAQQBJAEEAQQBnAEEAQwBBAEEASQBBAEEAZwBBAEMAQQBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQQBnAEEARAB3AEEAZABBAEIAeQBBAEMAQQBBAGMAdwBCADAAQQBIAGsAQQBiAEEAQgBsAEEARAAwAEEASQBnAEIAaQBBAEcARQBBAFkAdwBCAHIAQQBHAGMAQQBjAGcAQgB2AEEASABVAEEAYgBnAEIAawBBAEQAbwBBAEkAdwBCAG0AQQBHAE0AQQBaAGcAQgBqAEEARwBZAEEAWQB3AEEANwBBAEMASQBBAFAAZwBBADgAQQBIAFEAQQBaAEEAQQBnAEEASABNAEEAZABBAEIANQBBAEcAdwBBAFoAUQBBADkAQQBDAEkAQQBjAEEAQgBoAEEARwBRAEEAWgBBAEIAcABBAEcANABBAFoAdwBBADYAQQBEAGcAQQBjAEEAQgA0AEEARABzAEEASQBnAEEAKwBBAEQARQBBAE0AQQBBAHcAQQBEAEUAQQBQAEEAQQB2AEEASABRAEEAWgBBAEEAKwBBAEQAdwBBAGQAQQBCAGsAQQBEADQAQQBVAEEAQgB5AEEARwA4AEEAWgBnAEIAbABBAEgATQBBAGMAdwBCAHAAQQBHADgAQQBiAGcAQgBoAEEARwB3AEEAUABBAEEAdgBBAEgAUQBBAFoAQQBBACsAQQBEAHcAQQBkAEEAQgBrAEEARAA0AEEAUgBnAEIAbABBAEcAMABBAFkAUQBCAHMAQQBHAFUAQQBQAEEAQQB2AEEASABRAEEAWgBBAEEAKwBBAEQAdwBBAGQAQQBCAGsAQQBEADQAQQBNAHcAQQA4AEEAQwA4AEEAZABBAEIAawBBAEQANABBAFAAQQBBAHYAQQBIAFEAQQBjAGcAQQArAEEAQQBvAEEASQBBAEEAZwBBAEMAQQBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQQBnAEEAQwBBAEEASQBBAEEAZwBBAEMAQQBBAFAAQQBCADAAQQBIAEkAQQBQAGcAQQA4AEEASABRAEEAWgBBAEEAZwBBAEgATQBBAGQAQQBCADUAQQBHAHcAQQBaAFEAQQA5AEEAQwBJAEEAYwBBAEIAaABBAEcAUQBBAFoAQQBCAHAAQQBHADQAQQBaAHcAQQA2AEEARABnAEEAYwBBAEIANABBAEQAcwBBAEkAZwBBACsAQQBEAEUAQQBNAEEAQQB3AEEARABJAEEAUABBAEEAdgBBAEgAUQBBAFoAQQBBACsAQQBEAHcAQQBkAEEAQgBrAEEARAA0AEEAVgBBAEIAbABBAEcATQBBAGEAQQBCAHUAQQBHAGsAQQBZAHcAQgBwAEEARwBFAEEAYgBnAEEAOABBAEMAOABBAGQAQQBCAGsAQQBEADQAQQBQAEEAQgAwAEEARwBRAEEAUABnAEIATgBBAEcARQBBAGIAQQBCAGwAQQBEAHcAQQBMAHcAQgAwAEEARwBRAEEAUABnAEEAOABBAEgAUQBBAFoAQQBBACsAQQBEAGcAQQBQAEEAQQB2AEEASABRAEEAWgBBAEEAKwBBAEQAdwBBAEwAdwBCADAAQQBIAEkAQQBQAGcAQQBLAEEAQwBBAEEASQBBAEEAZwBBAEMAQQBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQQBnAEEAQwBBAEEASQBBAEEAZwBBAEQAdwBBAGQAQQBCAHkAQQBDAEEAQQBjAHcAQgAwAEEASABrAEEAYgBBAEIAbABBAEQAMABBAEkAZwBCAGkAQQBHAEUAQQBZAHcAQgByAEEARwBjAEEAYwBnAEIAdgBBAEgAVQBBAGIAZwBCAGsAQQBEAG8AQQBJAHcAQgBtAEEARwBNAEEAWgBnAEIAagBBAEcAWQBBAFkAdwBBADcAQQBDAEkAQQBQAGcAQQA4AEEASABRAEEAWgBBAEEAZwBBAEgATQBBAGQAQQBCADUAQQBHAHcAQQBaAFEAQQA5AEEAQwBJAEEAYwBBAEIAaABBAEcAUQBBAFoAQQBCAHAAQQBHADQAQQBaAHcAQQA2AEEARABnAEEAYwBBAEIANABBAEQAcwBBAEkAZwBBACsAQQBEAEUAQQBNAEEAQQB3AEEARABJAEEAUABBAEEAdgBBAEgAUQBBAFoAQQBBACsAQQBEAHcAQQBkAEEAQgBrAEEARAA0AEEAVABRAEIAaABBAEcANABBAFkAUQBCAG4AQQBHAFUAQQBjAGcAQQA4AEEAQwA4AEEAZABBAEIAawBBAEQANABBAFAAQQBCADAAQQBHAFEAQQBQAGcAQgBHAEEARwBVAEEAYgBRAEIAaABBAEcAdwBBAFoAUQBBADgAQQBDADgAQQBkAEEAQgBrAEEARAA0AEEAUABBAEIAMABBAEcAUQBBAFAAZwBBADAAQQBEAHcAQQBMAHcAQgAwAEEARwBRAEEAUABnAEEAOABBAEMAOABBAGQAQQBCAHkAQQBEADQAQQBDAGcAQQBnAEEAQwBBAEEASQBBAEEAZwBBAEMAQQBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQQBnAEEARAB3AEEATAB3AEIAMABBAEcASQBBAGIAdwBCAGsAQQBIAGsAQQBQAGcAQQBLAEEAQwBBAEEASQBBAEEAZwBBAEMAQQBBAEkAQQBBAGcAQQBDAEEAQQBJAEEAQQA4AEEAQwA4AEEAZABBAEIAaABBAEcASQBBAGIAQQBCAGwAQQBEADQAQQAKAHMAdQBmAGYAaQB4ADoA:31b8e172-b470-440e-83d8-e6b185028602

</tr>

→ Each employer appears multiple times across occupations and policies, including multi-choice questions.

Data Description – Challenges (cont.)

High Dimensionality: Each policy response becomes a binary feature.

  • With 83 questions and 500+ response options, this leads to 500+ binary-encoded predictors.
Raw Responses Encoded Features
  • Question: What employee support mechanisms are offered? | - Offers_Counselling = 1 |
    • ☑️ Counselling | - Offers_WorkersComp = 1 | |
    • ☑️ Workers comp | - Offers_FlexibleHours = 1 | |
    • ☑️ Flexible hours | |

Missingness: Incomplete questionnaire responses required imputation and cleaning.

Data Cleaning and Preparation

The WGEA data was consolidated into a single employer-level dataset for classification.

Data Integration

  • Merged seven raw datasets using employer_abn as key.
  • Combined workforce statistics and policy indicators into one wide table.

Feature Engineering

  • Workforce Composition Features: % employee count in each subcategory.

  • Organisational Policy Features: Binary policy presence flags.

  • Workforce Movement Features: Gender-specific promotion and resignation rates.

Data Cleaning and Preparation (cont.)

Target Variable Creation

  • Transformed management_female_percent into 5 categorical quantiles, each with ~20% of organisations.

Handling Missing Values

  • Dropped records without a valid employer_abn.
  • “numeric and binary NAs” -> 0 (absence = attribute not present).
  • “categorical NAs” -> “None Reported” (preserve informative missingness).

Result

Final dataset: 6673 employer-level records & 32 features fit for identifying patterns of gender representation in management.

EDA – Target Variable Distribution

  • Underlying distribution of female management share is right-skewed.
  • Most organisations have < 40% female representation in management.
  • Only a minority of firms reach or exceed gender parity.
  • Overall under representation of women in management roles.

EDA – PCA

  • PCA: parametric, global, linear view.
  • PC1 = 35.3%, PC2 = 18.4% together explain about 53.7% of total variance
  • PCA scatterplot: overlap between classes but a slight separation trend along PC1.
  • Scree plot: variance sharply declines after the first 3 components.

EDA – tSNE

  • t-SNE: non-parametric, local, non-linear view.
  • Organisations grouped by similar female management levels.
  • Very Low (red) and Very High (purple) categories form partial clusters.
  • Middle groups overlap, suggesting gradual structural transitions.
  • Non-linear models (eg. XGBoost, Decision Tree) better suited to capture complex patterns.

Modelling Plan

Trained Models

  • Multinomial Logistic Regression:
    • Interpretable linear baseline for multi-class data.
    • Ridge and Lasso regression were also test but were excluded due to performance.
  • Decision Tree:
    • Interpretable model using hierarchical feature splits.
    • Simple non-linear model trained with rpart, tuning cp between 0.001 and 0.05.
  • k-Nearest Neighbours:
    • Non-parametric model capturing non-linear boundaries.
    • Tuned neighbour count (k = 1–20) with feature standardisation (center, scale).
  • Support Vector Machine (RBF Kernel):
    • Kernel method effective for non-linear, high-dimensional data.
    • Tuned cost (C) and kernel width (sigma) via grid search (tuneLength = 15).
  • XGBoost:
    • Ensemble learner handling complex patterns with regularisation.
    • Gradient boosting model tuned via random search (tuneLength = 40)

Modelling Plan (cont.)

Model Training Pipeline

  • All models used same 5-fold stratified cross-validation with an 80/20 train-test split to ensure fair comparison.
  • Implemented with the caret package for consistent preprocessing, tuning, and evaluation.
  • Shared trainControl parameters across all models:
    • method = “cv”, number = 5
    • classProbs = TRUE for probability outputs
    • preProcess = c(“zv”, “center”, “scale”)
    • summaryFunction = multiClassSummary for macro metrics
    • savePredictions = “final” to retain fold-level predictions

Evaluation

  • Primary Metric: Macro F1-Score (equal weight to all five classes).
  • Secondary Metrics: Precision, Recall, AUC, and Ordinal Mean Absolute Error (MAE) for interpretability.

Model Results – Summary

Macro-Averaged Metrics
Model F1 Precision Recall AUC MAE
XGBoost 0.674 0.673 0.675 0.909 0.362
Decision_Tree 0.643 0.641 0.645 0.896 0.406
SVM 0.630 0.631 0.631 0.895 0.415
Multinomial_Logistic 0.623 0.623 0.627 0.895 0.421
KNN 0.507 0.506 0.515 0.843 0.608

Model Results – Confusion Matrix

Top Features – Logistic Regression

Class Top 3 Features Coefficient
Low full_time_female_percent 1.684***
executive_roles_female_percent 1.261***
`anzsic_divisionHealth Care and Social Assistance` -0.735***
Mod full_time_female_percent 2.806***
executive_roles_female_percent 2.112***
`anzsic_divisionHealth Care and Social Assistance` -0.996***
High full_time_female_percent 4.114***
executive_roles_female_percent 2.739***
percent_full_time -1.041***
V.High full_time_female_percent 5.829***
executive_roles_female_percent 3.273***
percent_full_time -1.105***

Note: The V.Low category is omitted as it serves as the base (reference) class in the multinomial logistic regression. All reported coefficients are expressed relative to this baseline.

Top Policy Features – Logistic Regression

Class Top 3 Policy Features Coefficient
Low has_flexible_work_policy 0.120**
offers_paid_secondary_carer_leave 0.117
has_target_for_gender_equity -0.085
Mod has_target_for_women_in_governing_body 0.160**
has_flexible_work_policy 0.138**
offers_paid_secondary_carer_leave 0.125
High gender_equality_training_for_managers 0.125*
has_flexible_work_policy 0.122*
actioned_on_pay_gap_analysis -0.114
V.High has_flexible_work_policy 0.209**
has_policy_for_gender_equality 0.149*
has_pay_equity_strategy -0.117

Note: The V.Low category is omitted as it serves as the base (reference) class in the multinomial logistic regression. All reported coefficients are expressed relative to this baseline.

Conclusion

Summary

  • Moderate predictive accuracy (AUC ≈ 0.9, MAE ≈ 0.4)
  • Full-time female share = dominant driver
  • Policies add incremental but smaller signal

Limitations

  • Cross-sectional → no causality
  • Self-reported data may inflate policy adoption
  • Sparse binary vars, 5-class bucketing hides nuance

Future Work

  • Analyse drivers of full-time female employment
  • Use ordinal-aware or hierarchical models
  • Add temporal data to assess causal impact