Feed SQ | Gordon Institute

Annual Institutional Report — GIAI (2024)

Picture

Member for

1 year 8 months

Real name

GIAI Admin

Bio

GIAI Admin represents the official administrative voice of the Gordon Institute of Artificial Intelligence (GIAI). This account manages institutional communications, announcements, and operational updates across GIAI’s research, education, and global initiatives.

Published

Jan 31, 2025 12:00

Updated

Mar 19, 2026 14:02

Entity: Gordon Institute of Artificial Intelligence (GIAI)
Reporting Period: January–December 2024
Report Type: Aggregate Institutional Review
Disclosure Level: Public Summary

1. Purpose

This report provides an aggregate overview of institutional activities across the GIAI ecosystem during 2024. It summarizes structural developments, cross-entity alignment, and key observations arising from the operation of affiliated systems, including SIAI, The Economy Network, MDSA, and SIAI Labs.

The report does not provide a comprehensive operational account of each entity. Instead, it presents a meta-level assessment of system coherence, functional separation, and institutional evolution.

2. Scope of Review

This report covers:

Structural relationships between affiliated entities
Cross-functional alignment and interaction
Observed developments in research, education, media, and evaluation layers
System-level risks, constraints, and adjustments

The following are excluded:

Detailed operational data from individual entities
Financial or commercial information
Internal deliberations or governance processes

3. Institutional Structure Overview

During the year, the GIAI ecosystem consisted of four primary functional layers:

The Economy Network
(media and publication layer, including The Economy, The Ranking News, and The EduTimes)
SIAI
(execution layer, including SIAI Research, Gordon School of Business, and SIAI Labs)
MDSA
(independent evaluation and oversight layer)
GIAI
(meta-governance and aggregate reporting layer)

This structure remained stable throughout the reporting period, with incremental refinements in role clarity and boundary definition.

4. System-Level Developments

Key developments observed across the system include:

Increased structural alignment between entities, particularly in terms of role definition and output separation
Consolidation of research and education activities under SIAI
Progressive standardization of editorial and academic processes
Emergence of SIAI Labs as a distinct exploratory unit within the broader system
Continued formalization of MDSA’s evaluation framework

No major structural reorganization was undertaken during this period.

5. Cross-Entity Observations

5.1 Functional Separation

Clearer differentiation has emerged between:

research (SIAI Research)
education (GSB)
media (The Economy Network)
evaluation (MDSA)

However, partial overlaps remain, particularly in:

analytical content that bridges research and editorial output
applied work that may originate in SIAI Labs and later inform core activities

5.2 Standardization vs Flexibility

The system reflects an increasing degree of standardization in:

editorial structure
academic processes
evaluation frameworks

At the same time, flexibility is preserved through:

SIAI Labs (exploratory layer)
selective methodological variation within SIAI Research

This balance remains unresolved and requires ongoing calibration.

5.3 Centralization

Across all entities, decision-making remains highly centralized.

While this supports coherence and consistency, it introduces:

scalability constraints
limited distribution of institutional responsibility

5.4 External Visibility vs Internal Structure

There is a divergence between:

internal structural complexity
external perception of the system

Public-facing outputs remain selective and do not fully reflect internal processes.

6. Risk & Constraint Assessment

The following system-level risks were identified:

Boundary Ambiguity:
Incomplete separation between research, editorial, and analytical functions
Over-Centralization:
Dependence on centralized control across multiple entities
Limited External Validation:
Absence of fully independent verification mechanisms beyond MDSA’s current scope
Legacy Dependence:
Continued reliance on earlier outputs (particularly within media) for visibility
Structural Opacity:
Limited public understanding of institutional architecture

7. Actions Taken at System Level

Reinforcement of entity-level role definitions
Introduction of standardized reporting structures across entities
Gradual reduction of structurally inconsistent legacy outputs
Clarification of SIAI’s role as the primary execution layer
Continued separation of experimental activity within SIAI Labs

8. Outstanding Issues

Lack of distributed governance mechanisms
Incomplete formalization of evaluation and review processes
Limited documentation of institutional methodology across entities
Absence of a fully articulated external-facing institutional framework

9. Directional Outlook

The system is expected to focus on the following areas in the next cycle:

Further clarification of functional boundaries across entities
Incremental formalization of internal standards and processes
Selective increase in external visibility of institutional structure
Continued development of evaluation mechanisms within MDSA
Controlled expansion of activities within SIAI and The Economy Network

No major expansion or structural transformation is planned at this stage.

10. Concluding Note

The GIAI ecosystem remains in a transitional phase, moving from a founder-driven structure toward a more formalized institutional system.

Progress during the year has been characterized by:

increased structural clarity
improved consistency
continued reliance on centralized coordination

Further development will depend on the system’s ability to maintain coherence while gradually distributing functions and formalizing its internal logic.

11. Governance Note

This report provides a high-level institutional summary. It does not represent a complete account of all activities conducted within affiliated entities. Certain structural, operational, and methodological details are omitted to preserve confidentiality and governance integrity.

Picture

Member for

1 year 8 months

Real name

GIAI Admin

Bio

Annual Academic Operations Report — Gordon School of Business (GSB) (2024)

Picture

Member for

1 year 8 months

Real name

Keith Lee

Bio

Keith Lee is Professor of AI and Finance at the Gordon School of Business, Swiss Institute of Artificial Intelligence (SIAI). His primary research lies in financial mathematics and AI-driven computational science, with a focus on quantitative modeling of complex economic and financial systems. His work integrates machine learning, stochastic modeling, and data-centric methods to study structural transformations in markets and institutions.

In recent years, his research has extended to the economic and fiscal implications of technological change, including the interaction between artificial intelligence, demographic shifts, and public finance sustainability.

He holds a PhD in Mathematical Finance from Boston University, and previously earned an MSc in Finance and Economics from the London School of Economics. He completed his undergraduate studies in Economics at Seoul National University under the Korea Foundation for Advanced Studies scholarship program.

He regularly contributes analytical essays on the broader socioeconomic implications of AI to The Economy Review.

Published

Jan 31, 2025 12:00

Updated

Mar 19, 2026 13:44

Entity: Gordon School of Business (GSB) @ SIAI
Reporting Period: January–December 2024
Report Type: Academic Operations, Admissions, and Review
Disclosure Level: Public Summary

1. Purpose

This report summarizes academic operations at GSB during 2024, including admissions, student progression, graduation outcomes, and review processes. It aims to provide a structured overview of program execution and academic standards.

2. Scope of Review

This report covers:

Admissions processes and outcomes
Academic progression and program structure
Graduation outcomes
Internal review and compliance-related processes

Excluded:

Individual student records
Internal faculty deliberations
Detailed assessment materials and grading data

3. Program Overview

GSB continued to operate its AI MBA and related programs under a structured model combining:

Business-oriented analytical training
Applied data science components
Project-based evaluation and dissertation work

The program maintained a selective admissions approach with emphasis on professional background and alignment with program objectives.

4. Admissions Summary

Admissions during the year followed a multi-stage evaluation process:

Initial screening (background and eligibility)
Structured evaluation (experience, capability, alignment)
Final review and selection

Observations:

Applicant diversity increased in geographic and professional backgrounds
Variation in technical readiness among applicants required additional calibration
Admissions decisions remained centrally coordinated

5. Academic Progression

Students progressed through a structured program including:

Core modules (business + analytical frameworks)
Applied coursework
Final project or dissertation component

Progression was monitored through milestone-based evaluation rather than continuous grading visibility.

Observations:

Variability in student preparedness affected pacing
Program structure remained stable with minor adjustments
Increased emphasis on applied outputs over theoretical coverage

6. Graduation Outcomes

Graduation was contingent upon:

Completion of required modules
Satisfactory performance in applied work
Submission and acceptance of final project/dissertation

Summary:

Graduation volume remained limited and selective
Completion standards were maintained without relaxation

7. Academic Standards & Review

GSB maintained internal academic oversight processes, including:

Periodic program review
Evaluation of curriculum structure
Alignment with external expectations where applicable

Preparatory alignment with external accreditation frameworks (e.g., Swiss-based review processes) continued, though formal outcomes are not disclosed in detail.

8. Observations

The program remains structurally coherent but dependent on centralized academic control
Variation in student profiles requires ongoing calibration of standards
Limited cohort size supports quality control but restricts scalability
External validation mechanisms remain under development

9. Actions Taken

Refinement of admissions evaluation criteria
Adjustment of program structure to improve alignment with objectives
Strengthening of academic progression checkpoints
Continued preparation for external review processes

10. Outstanding Issues

Lack of fully formalized accreditation status in public-facing terms
Dependence on centralized academic oversight
Limited standardization of evaluation criteria across cohorts
Need for clearer articulation of academic framework to external audiences

11. Next Steps

Continued alignment with accreditation and review frameworks
Further formalization of academic standards
Gradual expansion of program scale with controlled selectivity
Development of clearer public-facing academic documentation

12. Governance Note

This report is a public summary of academic operations. Specific student data, assessment materials, and internal deliberations are not disclosed to preserve confidentiality and academic integrity.

Picture

Member for

1 year 8 months

Real name

Keith Lee

Bio

Editorial Review Report — The Ranking News (2024)

Picture

Member for

1 year 8 months

Real name

GIAI Admin

Bio

Published

Jan 31, 2025 12:00

Updated

Mar 19, 2026 13:42

Entity: The Ranking News
Reporting Period: January–December 2024
Report Type: Editorial Operations & Standards
Disclosure Level: Public Summary

1. Purpose

This report provides a consolidated internal review of editorial operations and standards within The Ranking News during year 2024. It reflects on structural consistency, editorial discipline, and alignment with institutional positioning.

2. Scope of Review

This review covers:

Editorial workflow and publication processes
Structural alignment of content categories
Editorial standards and consistency
Contributor management and content control

Excluded:

Source attribution and investigative methods
Pre-publication editorial deliberations
Internal personnel evaluations

3. Key Developments

Transition toward a unified editorial taxonomy across all sections
Formal separation between narrative journalism and analytical/research outputs
Increased enforcement of tone standardization, particularly in English-language content
Reduction of regionally biased or rhetorically inconsistent legacy content
Establishment of baseline editorial guidelines for contributors

4. Operational Structure

Editorial production during the reporting period operated under a distributed contribution model with centralized control over final publication.

Content was categorized into three primary layers:

News / Narrative Layer (timely, interpretive reporting)
Analysis Layer (structured commentary with economic framing)
Research-adjacent Layer (long-form, quasi-academic outputs)

Editorial decisions were increasingly guided by structural alignment rather than topical opportunism. Publication frequency remained stable, with no deliberate attempt to maximize output volume.

5. Standards Framework

The editorial standards applied during this period were defined by:

Tone Discipline: Neutral, institutional, non-reactive language
Structural Consistency: Standardized article formats across categories
Separation of Functions: Distinction between opinion, reporting, and analysis
Non-disclosure Integrity: Strict protection of sources and internal processes

However, these standards remain partially enforced and are not yet uniformly internalized across all contributors.

6. Observations

Variability in contributor writing style continues to introduce inconsistencies in tone
Structural standardization has improved readability but not fully resolved conceptual drift between articles
English-language content has achieved greater alignment with institutional positioning compared to localized outputs
Editorial judgment remains dependent on centralized oversight rather than distributed editorial maturity

7. Actions Taken

Introduction of standardized editorial templates across content categories
Selective removal or revision of legacy content inconsistent with current positioning
Tightening of contributor guidelines, particularly regarding tone and structure
Reinforcement of separation between editorial and research-oriented outputs

8. Outstanding Issues

Lack of fully internalized editorial culture among contributors
Residual dependence on centralized editorial control
Incomplete differentiation between analysis and opinion in certain outputs
Limited visibility of methodological grounding in analytical content

9. Next Steps

Further codification of editorial standards into formal documentation
Gradual transition toward contributor-level enforcement of tone discipline
Continued restructuring of legacy content
Exploration of clearer public-facing categorization of content types

10. Governance Note

This report reflects an internal editorial assessment summarized for public disclosure. Certain operational details have been generalized or omitted to preserve confidentiality and maintain institutional integrity.

Picture

Member for

1 year 8 months

Real name

GIAI Admin

Bio

Annual Activity Report — SIAI Labs (2024)

Picture

Member for

1 year 8 months

Real name

GIAI Admin

Bio

Published

Jan 31, 2025 12:00

Updated

Mar 19, 2026 13:53

Entity: SIAI Labs
Reporting Period: January–December 2024
Report Type: Project Activity & Output Summary
Disclosure Level: Public Summary

1. Purpose

This report provides an overview of experimental and non-core initiatives conducted during 2024. These programs operate outside the primary institutional structure and serve as a controlled environment for testing new ideas, methodologies, and applications.

2. Scope of Coverage

This report covers:

Active experimental projects during the reporting period
General areas of exploration
Structural relationship with core institutional entities

Excluded:

Detailed project methodologies
Proprietary data or analytical frameworks
Project-specific financial or partnership information

3. Role within the Institutional System

Experimental Programs function as a non-core exploratory layer, with the following characteristics:

Separation from formal institutional commitments
Limited integration with SIAI, The Economy, or MDSA outputs
Flexibility in scope and methodology
Controlled exposure to public-facing platforms

Outputs from experimental programs are not automatically adopted into core systems.

4. Areas of Activity

During this year, experimental activity included:

Applied data analysis in emerging domains (e.g., entertainment analytics)
Exploratory ranking and benchmarking models
Testing of alternative content formats and distribution approaches
Development of prototype analytical frameworks

Activities remained selective and limited in scale.

5. Operational Approach

Experimental projects were conducted under a lightweight structure:

No formal expansion of dedicated personnel
Centralized oversight of project selection and continuation
Iterative development with no fixed output requirements

Projects were evaluated primarily on conceptual viability rather than immediate application.

6. Observations

Experimental outputs vary significantly in structure and quality
Some projects demonstrate potential for integration into core systems, though validation remains incomplete
Lack of standardized evaluation criteria limits comparability across initiatives
Controlled separation has prevented spillover effects into core institutional activities

7. Actions Taken

Maintenance of strict boundary between experimental and core systems
Selective continuation of projects demonstrating structural relevance
Discontinuation or deprioritization of low-coherence initiatives
Limitation of public exposure for experimental outputs

8. Outstanding Issues

Absence of formal evaluation framework for experimental outputs
Limited documentation of methodologies and findings
Potential redundancy across exploratory projects
Unclear pathway for integration into core institutional systems

9. Next Steps

Development of minimal evaluation criteria for experimental initiatives
Identification of projects suitable for integration into SIAI or The Economy
Continued limitation of scope to maintain structural clarity
Gradual documentation of reusable methodologies

10. Governance Note

Experimental Programs operate outside the formal institutional structure and are not subject to the same standards or review processes as core entities. This report provides a high-level summary only and does not reflect full internal activity.

Picture

Member for

1 year 8 months

Real name

GIAI Admin

Bio

Editorial Review Report — The Economy (2024)

Picture

Member for

1 year 8 months

Real name

GIAI Admin

Bio

Published

Jan 31, 2025 00:00

Updated

Mar 19, 2026 12:51

Entity: The Economy
Reporting Period: January–December 2024
Report Type: Editorial Operations & Standards
Disclosure Level: Public Summary

1. Purpose

This report provides a consolidated internal review of editorial operations and standards within The Economy during year 2024. It reflects on structural consistency, editorial discipline, and alignment with institutional positioning.

2. Scope of Review

This review covers:

Editorial workflow and publication processes
Structural alignment of content categories
Editorial standards and consistency
Contributor management and content control

Excluded:

Source attribution and investigative methods
Pre-publication editorial deliberations
Internal personnel evaluations

3. Key Developments

Transition toward a unified editorial taxonomy across all sections
Formal separation between narrative journalism and analytical/research outputs
Increased enforcement of tone standardization, particularly in English-language content
Reduction of regionally biased or rhetorically inconsistent legacy content
Establishment of baseline editorial guidelines for contributors

4. Operational Structure

Editorial production during the reporting period operated under a distributed contribution model with centralized control over final publication.

Content was categorized into three primary layers:

News / Narrative Layer (timely, interpretive reporting)
Analysis Layer (structured commentary with economic framing)
Research-adjacent Layer (long-form, quasi-academic outputs)

Editorial decisions were increasingly guided by structural alignment rather than topical opportunism. Publication frequency remained stable, with no deliberate attempt to maximize output volume.

5. Standards Framework

The editorial standards applied during this period were defined by:

Tone Discipline: Neutral, institutional, non-reactive language
Structural Consistency: Standardized article formats across categories
Separation of Functions: Distinction between opinion, reporting, and analysis
Non-disclosure Integrity: Strict protection of sources and internal processes

However, these standards remain partially enforced and are not yet uniformly internalized across all contributors.

6. Observations

Variability in contributor writing style continues to introduce inconsistencies in tone
Structural standardization has improved readability but not fully resolved conceptual drift between articles
English-language content has achieved greater alignment with institutional positioning compared to localized outputs
Editorial judgment remains dependent on centralized oversight rather than distributed editorial maturity

7. Actions Taken

Introduction of standardized editorial templates across content categories
Selective removal or revision of legacy content inconsistent with current positioning
Tightening of contributor guidelines, particularly regarding tone and structure
Reinforcement of separation between editorial and research-oriented outputs

8. Outstanding Issues

Lack of fully internalized editorial culture among contributors
Residual dependence on centralized editorial control
Incomplete differentiation between analysis and opinion in certain outputs
Limited visibility of methodological grounding in analytical content

9. Next Steps

Further codification of editorial standards into formal documentation
Gradual transition toward contributor-level enforcement of tone discipline
Continued restructuring of legacy content
Exploration of clearer public-facing categorization of content types

10. Governance Note

Picture

Member for

1 year 8 months

Real name

GIAI Admin

Bio

Data Scientific Intuition that defines Good vs. Bad scientists

Picture

Member for

1 year 8 months

Real name

Keith Lee

Bio

Published

Aug 5, 2024 01:24

Modified

Sep 13, 2025 10:54

Many amateur data scientists have little respect to math/stat behind all computational models
Math/stat contains the modelers' logic and intuition to real world data
Good data scientists are ones with excellent intuition

On SIAI's website, we can see most wannabe students go to MSc AI/Data Science program intro page and almost never visit MBA AI program pages. We have a shorter track for MSc that requires extensive pre-study, and much longer version that covers missing pre-studies. Over 90% of wannabes just take a quick scan on the shorter version and walk away. Less than 10% to the longer version, and almost nobody to the AI MBA.

We get that they are 'wannabe' data scientists with passion, motivation, and dream with self-confidence that they are the top 1%. But the reality is harsh. So far, less than 5% applicants have been able to pass the admission exam to MSc AI/Data Science's longer version. Almost never we have applicants who are ready to do the shorter one. Most, in fact, almost all students should compromise their dream and accept the reality. The fact that the admision exam is the first two courses of the AI MBA, lowest tier program, already bring students to senses that over a half of applicants usually disappear before and after the exam. Some students choose to retake the exam in the following year, but mostly end up with the same score. Then, they either criticize the school in very creative ways or walk away with frustrated faces. I am sorry for keeping such high integrity of the school.

Data Scientific Intuition that matters the most

The school focuses on two things in its education. First, we want students to understand the thought processes of data science modelers. Support Vector Machine (SVM), for example, reflects the idea that fitting can be more generalized if a separating hyperplane is bounded with inequalities, instead of fixed conditions. If one can understand that the hyperplane itself is already a generalization, it can be much easier to see through why SVM was introduced as an alternative to linear form fitting and what are the applicable cases in real life data science exercises. The very nature of this process is embedded in the school's motto, 'Rerum Cognoscere Causas' ((Felix, qui potuit rerum cognoscere causas - Wikipedia)), meaning a person pursuing the fundamental causes.

The second focus of the school is to help students where and how to apply data science tools to solve real life puzzles. We call this process as the building data scientific instuition. Often, math equations in the textbooks and code lines in one's program console screens do not have any meaning, unless it is combined in a way to solve a particular problem in a peculiar context with a specific object. Unlike many amateur data scientists' belief, coding libraries have not democratized data science to untrained students. In fact, the codes copied by the amateurs are evident examples of rookie failures that data science tools need must deeper background knowledge in statistics than simple code libraries.

Our admission exam is designed to weed out the dreamers or amateurs. After years of trials and errors, we have decided to give a full lecture of elementary math/stat course to all applicants so that we can not only offer them a fair chance but also give them a warning as realistic as our coursework. Previous schooling from other schools may help them, but the exam help us to see if one has potential to develop 'Rerum Cognoscere Causas' and data scientific intuition.

Intution does not come from hard study alone

When I first raised my voice for the importance of data scientific intution, I had had severe conflicts with amateur engineers. They thought copying one's code lines from a class (or a github page) and applying it to other places will make them as good as high paid data scientists. They thought these are nothing more than programming for websites, apps, and/or any other basic programming exercises. These amateurs never understand why you need to do 2nd-stage-least-square (2SLS) regression to remove measurement error effects for a particular data set in a specific time range, just as an example. They just load data from SQL server, add it to code library, and change input variables, time ranges, and computer resources, hoping that one combination out of many can help them to find what their bosses want (or what they can claim they did something cool). Without understanding the nature of data process, which we call 'data generating process' (DGP), their trials and errors are nothing more than higher correlation hunting like untrained sociologists do in their junk researches.

Instead of blaming one code library worse performing than other ones, true data scientists look for embedded DGP and try to build a model following intuitive logic. Every step of the model requires concreate arguments reflecting how the data was constructed and sometimes require data cleaning by variable re-structuring, carving out endogeneity with 2SLS, and/or countless model revisions.

It has been witnessed by years of education that we can help students to memorize all the necessary steps for each textbook case, but not that many students were able to extend the understanding to ones own research. In fact, the potential is well visible in the admission exam or in the early stage of the coursework. Promising students always ask why and what if. Why SVM's functional shape has $1/C$ which may limit the range of $C$ in his/her model, and what if his/her data sets with zero truncation ends up with close to 0 separating hyperplane? Once the student can see how to match equations with real cases, they can upgrade imaginative thought processes to model building logic. For other students, I am sorry but I cannot recall successful students without that ability. High grades in simple memory tests can convince us that they study hard, but lack of intuition make them no better than a textbook. With the experience, we design all our exams to measure how intuitive students are.

Intuition that frees a data scientist

In my Machine Learning class for tree models, I always emphasize that a variable with multiple disconnected effective ranges in trees has a different spanned space from linear/non-linear regressions. One variable that is important in a tree space, for example, may not display strong tendency in linear vector spaces. A drug that is only effective to certain age/gender groups (say 5~15, 60~ for male, 20~45 female) can be a good example. Linear regression hardly will capture the same efffective range. After the class, most students understand that relying on Variable Importances of tree models may conflict with p-value type variable selections in regression-based models. But only students with intuition find a way to combine both models that they find the effective range of variables from the tree and redesign the regression model with 0/1 signal variables to separate the effective range.

The extend of these types of thought process is hardly visible from ordinary and disqualified students. Ordinary ones may have capacity to discern what is good, but they often have hard time to apply new findings to one's own. Disqualified students do not even see why that was a neat trick to the better exploitation of DGP.

What's surprising is that previous math/stat education mattered the least. It was more about how logical they are, how hard-working they are, and how intuitive they are. Many students come with the first two, but hardly the third. We help them to build the third muscle, while strenghtening the first. (No one but you can help the second.)

The re-trying students ending up with the same grades in the admission exam are largely because they fail to embody the intuition. It may take years to develop the third muscle. Some students are smart enough to see the value of intuition almost right away. Others may never find that. For failing students, as much as we feel sorry for them, we think that their undergraduate education did not help them to build the muscle, and they were unable to build it by themselves.

The less chanllenging tier programs are designed in a way to help the unlucky ones, if they want to make up the missing pieces from their undergraduate coursework. Blue pills only make you live in fake reality. We just hope our red pill to help you find the bitter but rewarding reality.

Picture

Member for

1 year 8 months

Real name

Keith Lee

Bio

AI Pessimism, just another correction of exorbitant optimism

Picture

Member for

1 year 7 months

Real name

Ethan McGowan

Bio

Professor of AI/Finance, Gordon School of Business, Swiss Institute of Artificial Intelligence

Ethan McGowan is a Professor of AI/Finance and Legal Analytics at the Gordon School of Business, SIAI. Originally from the United Kingdom, he works at the frontier of AI applications in financial regulation and institutional strategy, advising on governance and legal frameworks for next-generation investment vehicles. McGowan plays a key role in SIAI’s expansion into global finance hubs, including oversight of the institute’s initiatives in the Middle East and its emerging hedge fund operations.

Published

Jul 31, 2024 20:48

Modified

Sep 13, 2025 10:54

AI talks turned the table and become more pessimistic
It is just another correction of exorbitant optimism and realisation of AI's current capabilities
AI can only help us to replace jobs in low noise data
Jobs needing to find new patterns and from high noise data industry, mostly paid more, will not be replaceable by current AI

There have been pessimistic talks about the future of AI recently that have created sudden drops in BigTech firms' stock prices. In all of a sudden, all pessimistic talks from Investors, experts, and academics in reputed institutions are re-visited and re-evaluated. They claim that ROI (Return on Investment) for AI is too low, AI products are too over-priced, and economic impact by AI is minimal. In fact, many of us have raised our voices for years with the exactly same warnings. 'AI is not a magic wand'. 'It is just correlation but not causality / intelligence'. 'Don't be overly enthusiastic about what a simple automation algorithm can do'.

As an institution with AI in our name, we often receive emails from a bunch of 'dreamers' that they wonder if we can make a predictive algorithm that can foretell stock price movements with 99.99% accuracy. If we could do that, why do you think we would share the algorithm with you? We should probably keep it for secret and make billions of dollars just for ourselves. As much as the famous expression by Milton Friedman, a Nobel economist, there is no such thing as a free lunch. If we have a perfect predictability and it is widely public, then the prediction is no longer a prediction. If everyone knows the stock A's price goes up, then everyone would buy the stock A, until it reaches to the predicted value. Knowing that, the price will jump to the predicted value, almost instantly. In other words, the future becomes today, and no one gets benefited.

AI = God? AI = A machine for pattern matching

A lot of enthusiasts have exorbitant optimism that AI can overwhelm human cognitivie capacity and soon become god-like feature. Well, the current forms of AI, be it Machine Learning, Deep Learning, and Generative AI, are no more than a machine for pattern matching. You touch a hot pot, you get a burn. It is painful experience, but you learn that you should not touch when it is hot. The worse the pain, the more careful you become. Hopefully it does not make your skin irrecoverable. The exact same pattern works for what they call AI. If you apply the learning processes dynamically, that's where Generative AI comes. The system is constantly adding more patterns into the database.

Though the extensive size of patterns does have great potential, it does not mean that the machine has cognitive capacity to understand the pattern's causality and/or to find new breakthrough patterns from list of patterns in the database. As long as it is nothing more than a pattern matching system, it never will.

To give you an example, can it be used what words you are expected to answer in a class that has been repeated for thousand times? Definitely. Then, can you use the same machine to predict the stock price? Aren't the stock market repeating the same behavior over a century? Well, unfortunately it is not, thus you can't be benefited by the same machine for financial investments.

Two types of data - Low noise vs. High noise

On and near the Wall Street, you can sometimes meet an excessively confident hedge fund manager with claims on near perfect foresight for financial market movements. Some of them have outstanding track records, and surprisingly persuasive. In New York Times archive back in 1940s, or even as early as 1910s, you can see people with similar claims were eventually sued by investors, arrested due to false claims, and/or just disappeared from the street within a few years. If they were that good, why then they lost money and got sued/arrested?

There are two types of data. One set of data that you can see from machine (or highly controlled environment) is called 'Low-noise' data. It has high predictability. Even in cases where embedded patterns are invisible by bare eyes, you either need more analytic brain or a machine to test all possibilities within the possible sets. For the game of Go, the brain was Se-dol Lee and the machine was Alpha-Go. The game needs to test 19x19 possible sets with around 300 possible steps. Even if your brain is not as good as Se-dol Lee, as long as your computer can find the winning patterns, you can win. This is what has been witnessed.

The other set of data comes from largely uncontrolled environment. There potentially is a pattern, but it is not the single impetus that drives every motion of the space. There are thousands, if not millions, of patterns that the driver is not observable. This is where randomness is needed for modeling, and it is unfortunately impossible to predict accurate move, because the driver is not observable. We call this set of data 'High-noise'. The stock market is the very example of such. There are millions of unknown, unexpectable, and at least unmeasurable influences that disable any analyst or machine to predict with accuracy level upto 100%. This is why financial models are not researched for predictability but used only to backtest financial derivatives for reasonable pricing.

Natural language process (NLP) is one example of low noise. Our language follows a certain set of rules (or patterns), which are called grammar. Unless you are uneducated or intentionally out of grammar (or make mistakes), people generally follow grammar. Weather is mostly low noise, but it has high noise components. Sometimes typhoons are unpredictable, or less predictable. Stock market? Be my guest. There have been 4 Nobel Prizes given to financial economists by year 2023, and all of them are based on the belief that stock markets follow random processes, be it Gaussian, Poisson, and/or any other unknown random distributions. (Just in case, if a process follows any known distribution, that means it is probabilistic, which means it is random.)

Potential benefits of AI

We as an institution hardly believe current forms of AI will make any significant changes in businesses and our life in short term. The best we can expect is automation of mundane tasks. Like laundary machine in early 20th century. ChatGPT already has shown us a path. Soon, CS operators will largely be replaced by LLM based chatbots. US companies actively outsourced the function from India for the past a few decades, thanks to cheaper international connectivity via internet. It will still remain, but human actions will be needed way less than before. In fact, we already get machine generated answers from a number of international services. If we complain about a program's malfunction on a WordPress plugin, for instance, established services email us machine answers first. For a few cases, it actually is enough. The practice will become more popular to less-established services as it becomes easier and cheaper to implement.

Teamed up with EduTimes, we also are working on a research to replace 'Copy Boys/Girls'. Journalists that we know from large news magazines are not always running on the street to find new and fascinating stories. In fact, most of them read other newspapers and rewrite the contents as if they were the original sources. Although it is not an important job, it is still needed for the newspaper to run. They need to keep up the current events, accoring to the EduTimes journalists from other renouned newspapers. The copy team is usually paid the least and seen a death sentence as a journalist. What makes the job more sympathetic on top of the least respect, it will soon be replaced by LLM based copywriters.

In fact, any job that generates patterned contents without much of cognitivie functions will gradually be replaced.

What about automotive driving? Is it a low-noise pattern job or a high-noise complicated cognitive job? Well, although Elon Musk claims high possibility of Lv. 4 auto-driving within next a few years, we don't believe so. None of us at GIAI have seen any game theorists have solved multi-agent ($n$>2) Bayesian belief game with imperfect information and unknown agent types by computer so that the automotive driving algorithm can predict what other drivers on the road will do. Without the right prediction of others on the fast moving vehicles, it is hard to tell if your AI will help you successfully avoid other crazy drivers. The driving job for those eventful cases needs 'instinct', which requires another set of bodily function different from cognitive intelligence. The best that the current algorithm can do is to tighten it up to perfection for a single car, which already needs to go over a lot of mathematical, mechanical, organisational, legal, and commercial (and many more) challenges.

Don't they know all that? Aren't the Wall Street investors self-confident, egocentric, but ultra smart that they already know all the limitations of AI? We believe so. At least we hope so. Then, why do they pay attention to the discontentful pessimism now, and create heavy drops in tech stock prices?

Guess the Wall Street hates to see Silicon Valley to be paid too much. American East often think the West too unrealistic and floating in the air. OpenAI's next round funding may surprise us in a totally opposite direction.

Picture

Member for

1 year 7 months

Real name

Ethan McGowan

Bio

MSc AI/Data Science vs. Boot Camp for AI

Picture

Member for

1 year 7 months

Real name

David O'Neill

Bio

Professor of AI/Policy, Gordon School of Business, Swiss Institute of Artificial Intelligence

David O’Neill is a Professor of AI/Policy at the Gordon School of Business, SIAI, based in Switzerland. His work explores the intersection of AI, quantitative finance, and policy-oriented educational design, with particular attention to executive-level and institutional learning frameworks.

In addition to his academic role, he oversees the operational and financial administration of SIAI’s education programs in Europe, contributing to governance, compliance, and the integration of AI methodologies into policy and investment-oriented curricula.

Published

Jul 29, 2024 01:53

Modified

Sep 13, 2025 10:54

Boot camp is for software programming without mathematical training
MSc is a track for PhD, with in-depth scientific research written in the language of math and stat
We respect programmers, but our works are significantly varying

Due to the fact that we are running SIAI, an higher educational institution for AI/Data Science, we often have questions about the difference between Boot Camps for AI and MSc programmes. The shortest answer is the difference in Math requirements. Masters track is for people looking for academic training so that one can read academic papers in that subject. With PhD in the topic, we expect the student to be able to lead a research. From Boot Camp, sorry to be a little aggressive here, but we only expect a 'Coding Monkey'.

We are aware that many countries are shallow in AI/Data Science that they want employees only to be able to best use of Open AI's and AWS's libraries by Rest API. For that, boot camp should be enough, unless the boot camp teacher does not know how to do so. There are nearly infinite amount of contents for how to use Rest API for your software, regardless of your backend platform, be it an easy script languages like Python or tough functional ones like OCaml. Difficulties are not always indicators of determinants in challenges, and we, as data scientists at GIAI, care less about what language you use. What's important is how flexible your thinking for mathematically contained modeling.

Boot camp for software programing, MSc for scientific training

Unfortunately, unless you are lucky enough to be born as smart as Mr. Ramanujan, you cannot learn math modeling skills from a bunch of blogs. Programming, however, has infinitely many proven records of excellent programmers without school traninng. Elon Musk is just one example. He did Economics and Physics in his undergrad at U Penn, and he only stayed one day in the mechanical engineering PhD program at Stanford University. Programming is nothing more than a logic, but math needs too many building blocks to understand the language.

When we first build SIAI, we had quite a lengthy discussion for weeks. Keith was firm that we should stick to mathematical aspects of AI/Data Science. (which doesn't mean we should only teach math, just to avoid any misunderstanding.) Mc wanted two tier tracks for math and coding. We later found that with coding, it is unlikely that we can have the school accreditted by official parties, so we end up with Keith's idea. Besides, we have seen too many Boot Camps around the world that we do not believe we can be competitive in that regard.

The founding motto of the school is 'Rerum Cognoscere Causaus', meaning 'the real cause of things'. With mathematical tools, we were sure that we can teach what are the reason behind a computational model was first introduced. Indeed, Keith has done so well in his Scientific Programming that most students no longer bound to media brainwashing that Neural Network is the most superior model.

Scientists do our own stuff

If you just go through Boot Camps for coding, chances are that you can learn the limitations of Neural Network just by endless trials and errors, if not somebody's Medium posts and Reddit comments. In other words, without the proper math training, it is unlikely one can understand how the computational logics of each model are built, which makes us to aloof from all programmers without necessary math training.

The very idea comes from multiple rounds of uneasy exposures to software engineers without a shred of understanding in modeling side of AI. They usually claim that Neural Network is proven to be the best model, and they do not need any other model knowledge. And all they have to do is to run and test it. Researchers at GIAI are trained scientists, and we mostly can guess what will happen just by looking at equations. And, most importantly, we are well aware that NN is the best model only for certain tasks.

They kept claim that they were like us, and some of them wanted to build a formal assocation with SIAI (and later GIAI). It's hard for us to work with them, if they keep that attitude. These days, whenever we are approached by third parties, if they want to be at equals with us, we ask them to show us math training levels. Please make no mistake that we respect them as software engineers, but we do not respect them as scientists.

Guess aforementioned story and internal discomfort tells you the difference between software engineers and data/research scientists, let alone tools that we rely on.

We screen out students by admission exams in math/stat

With the experience, Keith initiated two admission exams for our MSc AI/Data Science programmes. At the very beginning, we thought there will be plenty of qualifying students, so we used final year undergrad materials. There was a disaster. We gave them two months of dedicated training. Provided similar exams and solved each one of them with extra detail. But, only 2 out of 30 students were able to get grades good enough to be admitted.

We lowered the level down to European 2nd year (perhaps American 3rd year), and the outcome wasn't that different. Students were barely able to grasp superficial concepts of key math/stat. This is why we were kinda forced to create an MBA program that covers European 2nd year teaching materials with ample amount of business application cases. With that, students survive, but answer keys in their final exam tell us that many of them belong to coding Boot Camps, not SIAI.

From year 2025 and onwards, we will have one admission exam for MSc AI/Data Science (2 year) in March, after 2 months pre-training in Jan and Feb. The exam materials will be 2nd year undergrad level. If a student passes, we offer an exam with one notch up in June, again after 2 months pre-training in Apr and May. This will give them MSc AI/Data Science (1 year) admission.

Students who failed the 2-year track admission, we offer them MBA AI program admission, which covers some part of the 2-year track courses. If they think they are ready, then in the following year, they can take the admission exam again. After a year of various courework, some students have shown better performance, based on our statistics, but not by much. It seemed like the brain has its limit that they cannot go above.

Precisely by the same reason, we are reasonably sure that not that many applicants will be able to come to 2-year track, and almost no one for the 1-year track. More details are available from below link:

Program comparison - Swiss Institute of Artificial Intelligence (siai.org)

Picture

Member for

1 year 7 months

Real name

David O'Neill

Bio

Why Companies cannot keep the top-tier data scientists / Research Scientists?

Picture

Member for

1 year 8 months

Real name

Keith Lee

Bio

Published

Jul 28, 2024 23:07

Modified

Sep 13, 2025 10:54

Top brains in AI/Data Science are driven to challenging jobs like modeling
Seldom a 2nd-tier company, with countless malpractices, can meet the expectations
Even with $$$, still they soon are forced out of AI game

A few years ago, a large Asian conglomerate acquired a Silicon Valley's start-up just off an early Series A funding. Let's say it is start-up $\alpha$. The M&A team leader later told me that the acquisition was mostly to hire the data scientist in the early stage start-up, but the guy left $\alpha$ on the day the M&A deal was announced.

I had an occation to sit down with the data scientist a few months later, and asked him why. He tried to avoide the conversation, but it was clear that the changing circumstances definitely were not within his expectation. Unlike other bunch of junior data scientists in Silicon Valley's large firms, he did signal me his grad school training in math and stat that I had a pleasant half an hour talk about models. He was mal-treated in large firms that he was given to run SQL queries and build Tableau-based graphes, like other juniors. His PhD training was useless in large firms, so he had decided to be a founding member of $\alpha$ that he can build models and test them with live data. The Asian acquirer with bureaucratic HR system wanted him to give up his agenda and to transplant the Silicon Valley large firm's junior data scientist training system to the acquirer firm.

Brains go for brains

Given tons of other available positions, he didn't waste his time. Personall,y I also have lost some months of my life for mere SQL queries and fancy graphes. Well, some people may still go for 'data scientist' title, but I am my own man. So was the data scientist from $\alpha$.

These days, Silicon Valley firms call the modelers as 'research scientists', or simliar names. There also are positions called 'machine learning engineers' whose jobs somewhat related to 'research scientists', but may disinclude mathematical modeling parts and way more software engineering in it. The title 'Data Scientists' are now given to jobs that were used to be called 'SQL monkeys'. As the old nickname suggests, not that many trained scientists would love to do the job, even with competitive salary package.

What companies have to understand is that we, research scientists, are not trained for SQL and Tableau, but mathematical modeling. It's like a hard-trained sushi cook(将太の寿司, shota no sushi) is given to make street food like Chinese noodle.

Let me give you an example in real corporate world. Let's say a semi-conductor company, $\beta$ wants to build a test model for a wafer / subsctrate. What I often hear from those companeis are that they build a CNN model that reads the wafer's image and match it with pre-labeled 0/1 for error detection. In fact, similar practices have been widely adapted practice among all Neural Network maniacs. I am not saying it does not work. It works. But then, what would you do, if the pre-label was done poorly? Say, the 0/1 entries were like over 10,000 and hardly any body double checked the accruracy. Can you rely on that CNN-based model? In addition to that, the model probably require enourmous amount of computational costs to build, let alone test and operating it daily.

Wrong practice that drives out brain

Instead of the costly and less scientific option, we can always build a model that captures data's generated process(DGP). The wafer is composed of $n \times k$ entries, and issues emerge when $n \times 1$ or $1 \times k$ entries go wrong altogether. Given the domain knowledge, one can build a model with cross-products between entries in the same row/column. If it is continuously 1 (assume 1 for error), then it can easily be identified as a defect case.

Cost of building a model like that? It just needs your brain. There is a good chance that you don't even need a dedicated graphics card for that calculation. Maintenance costs are also incomparably smaller than the CNN version. The concept of computational cst is something that you were supposed to learn in any scientific programming classes at school.

For companies sticking to the expensive CNN options, I always can spot followings:

The management has little to no sense of 'computational cost'
The manaement cannnot discern 'research scientists' and 'machine learning engineers'
The company is full of engineers without the sense of mathematical modeling

If you want to grow up as a 'research scientist', just like the guy at $\alpha$, then run. If you are smart enough, you must have already run, like the guy at $\alpha$. After all, this is why many 2nd-tier firms end up with CNN maniacs like $\beta$. Most 2nd-tier firms are unlucky that they cannot keep research scientists due to lack of knowledge and experience. Those companies have to spend years of time and millions of wasted dollars to find that they were so long. By the time that they come to senses, it is mostly already way too late. If you are good enough, don't waste your time on a sinking ship. The management needs so-called cold-turkey type shock treatment as a solution. In fact, there was a start-up that I stayed only for a week, which lost at least one data scientist in everyweek. The company went to bankrupt in 2 years.

What to do and not to do

At SIAI, I place Scientific Programming right after elementary math/stat training. Students see that each calculation method is an invention to overcome earlier available options' limitations but simultanesouly the modification bounds the new tactic in another directions. Neural Networks are just one of the many kinds. Even with the eye-opening experience, some students still remain NN maniacs, and they flunk in Machine Learning and Deep Learning classes. Those students believe that there must exist a grand model that is univerally superior to all other models. I wish the world is that simple, but my ML and DL courses break the very belief. Those who are awaken, usually become excellent data/research scientists. Many of them come back to me that they were able to minimize computational costs by 90% just by replacing blindly implemented Neural Network models.

Once they see that dramatic cost reduction, at least some people understand that the earlier practice was wrong. The smarty student may not be happy to suffer from poor management and NN maniacs for long. Just like the guy at $\alpha$, it is always easier to change your job than fighting to change your incapable management. Managers moving fast maybe able to withhold the smarty. If not, you are just like the $\beta$. You invest a big chunk of money for an M&A just to hire a smarty, but the smarty disappears.

So, if you want to keep the smarty? Your solution is dead simple. Test math/stat training levels in scientific programming. You will save tons of $$$ in graphic card purchase.

Picture

Member for

1 year 8 months

Real name

Keith Lee

Bio

ChatGPT to replace not (intelligent) jobs but (boring) tasks

Picture

Member for

1 year 7 months

Real name

Ethan McGowan

Bio

Published

Jul 17, 2024 00:31

Modified

Sep 13, 2025 10:54

ChatGPT is to replace not jobs but tedious tasks
For newspapers, 'rewrite man' will soon be gone
For other jobs, the 'boring' parts will be replaced by AI, 
but not the intellectual and challenging parts

There has been over a year of hype for Large Language Models(LLMs). At the onset and initial round of hype, people outside of this field asked me if their jobs were to be replaced by robots. By now, over a year of trials with ChatGPT, they finally seem to understand that it is nothing more than an advanced chatbot that still is unable to stop generating 'bullshit', according to Noam Chomsky, an American professor and public intellectual known for his work in linguistics and social criticism.

As my team at GIAI predicted in early 2023, all LLM trials will be able to replace some jobs, but most jobs that will be replaced will be simple mundane tasks. That's because these language models are meant to find higher correlation between text/image groups, but still unable to 'intelligently' find logical connection between thoughts. In statistics, it is called high correlation with no causality, or simply 'spurious relations'.

LLMs will replace 'copying boys/girls'

When we were first approached by EduTimes back in early 2022, they thought we could create an AI machine to replace writers and reporters. We told them the best we can create is to replace a few boring desk jobs like 'rewrite man'. The job that requires to rewrite what other newspapers have already reported. 'Copy boy' is one well-known disparaging term for that job. Most large national magazines have such employees, just to keep their magazines to be up-to-dated with recent news.

Since none of us at GIAI are from journalism, and EduTimes is far from a large national magazine, we are not aware of exact proportion of 'rewrite man' in large magazines, let alone how many articles are re-written by them. But based on what we see from magazines, we can safely argue that at least 60~80% articles are probably written by the 'copy boys/girls'. Some of them are at the high risk of plagiarism. This is one sad reality of journalism industry, accoring to the EduTimes team.

The LLM that we are working on, GLM(GIAI's Language Model), isn't that different from other competitors in the market that we also have to rely on text bodies' correlations, or more precisely 'associations' by the association rules in machine learning textbooks. Likewise, we also have lots of inconsistency problems. To avoid the Noam Chomsky's famous accusation, 'LLMs are bullshit generators', the best any data scientist can do is just to set a high cut-off in support, confidence, and lift. Beyond that, it is not the job of data models, which includes all AI variants for pattern recognition.

But still correlation does not necessarily mean causality

The reason we see infinitely many 'bullshit' cases is because the LLM services still belong to statistics, a discipline to find not causality but correlation.

If high correlation can be translated to high causality, there has to be one important condition satisfied. The data set contains all coherent information so that high correlation naturally means high causality. This actually is where we need the EduTimes. We need clean, high quality, and topic-specific data.

After all, this is why OpenAI is willing to pay for data from Reddit.com, a community with intense and quality discussions. LLM service providers are in negotiation with U.S. top newspapers precisely the same reason. Although it does not mean that coherent and quality news articles will give us 100% guarantee in correlation to causality, at least we can establish a claim that disturbing cases will largely be gone without time-consuming technical optimization.

By the same logic, jobs that can be replaced by LLMs or any other AIs with pattern matching algorithms are the ones that have strong and repeating patterns that does not require logical connections.

AI can replace not (intelligent) jobs but (boring) tasks

As we often joke around at GIAI, technologies are bounded by mathematical limitations. Unfortunately, we are not John von Neumann who can solve every impossible mathematical challenges as easy as college problem sets. Thanks to computational breakthroughs, we are already at the level far from what we expected 10 years ago. Back then, we did not expect to extract corpora from 10 books in a few minites. If anything, we thought it needed weeks of supercomputer resources. It is not anymore. But even with surprising speed of computational achievements, we are still bound to mathematical limits. As said, correlation without causality is 'bullshit'.

With the current mathematical limitations, we can say

AI can replace not (intelligent) jobs but (super mega ultra boring) tasks

And, the replaceable tasks are boring, tedious, repetitive, and patterned tasks. So, please stop worrying about losing jobs, if yours torture your brain to think. Instead, plz think about how to use LLMs like automation to lighten your burden from mundane tasks. It will be like your mom's laundary machine and dish washer. Younger generation females no longer are bound to housekeeping. They go out to work places and fight for the positions that meet their dreams, desires, and wants.

Picture