-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathdata.json
More file actions
5794 lines (5794 loc) · 481 KB
/
data.json
File metadata and controls
5794 lines (5794 loc) · 481 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
{
"projects": [
{
"student_name": "Sophia G.",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/student_imgs/25fb3536164ff3412efca7e41e08e85763b9c40f.png",
"domains": [
"Humanitarian Efforts and Social Justice",
"Published Papers and Science Fairs"
],
"project_title": "A Machine Learning Approach to Understanding the Determining Factors of the Gender Wage Gap",
"project_desc": "Gender inequality is a complex subject consisting of a variety of issues and nuances. In this project, we choose to study gender income inequality—a prevalent issue in current society. Among the many factors that play a role in the gender wage gap, we focus on the affects of marital status, race, geographical location (by state), age, and years of education. By using these variables to create a model able to predict the hourly wage gap between a woman and their equivalent male counterpart, we can analyze the impact of each variable to better understand the role they play in the income gap. Utilizing income data from the Current Population Survey, we train and test five models—a Linear Regression, Decision Tree Regressor, Random Forest Regressor, KNeighbors Regressor, and MLP Regressor. Our Linear Regression model found that there is a correlation between being a never married worker and a smaller gender wage gap, as well as being a married worker with an absent spouse and a greater gender wage gap. In general, though, our models found little correlation between the variables provided and the predicted hourly age gap. \n",
"research_paper": "assets/pdfs/4c2dec4fb3c31bc305c9008eb6addb04d1747d92.pdf",
"project_yr": "2022",
"project_quarter": "Summer",
"github": "https://github.com/sophiaguan/genderwagegapanalysis",
"graphic_link": "http://media.wbur.org/wp/2016/04/0416_oag-pay-21-1000x666.jpg",
"headline": " By studying the affect of different attributes on the gender wage gap, we can better understand both the scale of this issue and its possible solutions. So, we explore the question, how does a worker’s marital status, along with other variables, impact the gap in hourly wage between male and female workers? We seek to create a model able to predict the gender wage gap given a set of variables—age, years of education, race, state, and marital status.",
"project_id": "0a574bde2fff76c3dafb371ae2cef1cfbd7ed854",
"expand": true,
"tags": [
"Wage Gap",
"Economics",
"Published",
"Science Fairs"
],
"published": true,
"science_fairs": "2nd Place at San Diego BROADCOM Science Fair (Senior Division)",
"related_proj": [
21,
133,
7
]
},
{
"student_name": "Alice H.",
"mentor_name": "Tony Rodriguez",
"mentor_title": "PhD Candidate in Astrophysics at Caltech",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/student_imgs/e3c71b4cb5bdd5775de029db904c939030e0e618.png",
"domains": [
"Physics",
"Published Papers and Science Fairs"
],
"project_title": "A Hybrid CNN-LSTM Model For Predicting Solar Cycle 25",
"project_desc": "The solar cycle is linked to the number of visible sunspots and follows the fluctuations of the Sun’s magnetic field. It can have powerful global impacts on the Earth. Thus, predicting the timing and amplitude of the peak of the incoming solar cycle 25 is of great importance. This study uses a hybrid deep learning CNN-LSTM model and observed 13-month smoothed sunspot numbers to predict Solar Cycle 25. The MinMax normalization substantially reduces the error of the CNN-LSTM model’s solar cycle predictions compared to the Standard Deviation normalization. The results suggest that it is best to use four historical solar cycles to predict the future solar cycle. The predicted Solar Cycle 25 will have a peak amplitude similar to the 13-month smoothed peak of solar cycle 24. The predicted solar cycle 25 peak spans a relatively long period of time between approximately August 2023 and July 2024. ",
"project_yr": "2022",
"project_quarter": "Summer",
"graphic_link": "https://media.wtsp.com/assets/WTSP/images/20519ffa-0230-435b-8f2b-46718cd853c8/20519ffa-0230-435b-8f2b-46718cd853c8_750x422.jpg",
"headline": "The goal of this study is to predict Solar Cycle 25 through the deep learning approach,and determine what parameters affect prediction accuracy and what the optimal number of historical solar cycles are used to reliably and accurately predict the upcoming solar cycle. The solar cycle predictions will help us prepare ahead of time for future solar activity.",
"project_id": "073776ccdb02a715b1a9c4ea590c72703600cadc",
"expand": false,
"tags": [
"LSTMs",
"Solar Cycles",
"Astronomy",
"Published"
],
"published": true,
"publications": "Journal of Student Research",
"publication_link": "https://www.jsr.org/hs/index.php/path/article/view/3996",
"related_proj": [
36,
12,
2
]
},
{
"student_name": "Alex M.",
"mentor_name": "Odysseas Drosis",
"mentor_title": "PhD Candidate in Computer Science, Masters in Computer Science Alum from Cornell",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/missing_image.png",
"domains": [
"Business and Finance",
"Published Papers and Science Fairs"
],
"project_title": "Applications of AI in Microfinance ",
"project_desc": "Using AI models to predict who will be able to get a microloan",
"project_yr": "2022",
"project_quarter": "Summer",
"graphic_link": "https://akm-img-a-in.tosshub.com/businesstoday/images/story/202007/investment_660x450_190220100047_200720121651.jpg",
"headline": "I hope to explore how to use AI in the field of microfinance to help reduce income inequality. Microfinance has greatly helped decrease rural poverty rates in Bangladesh. The leader of this effort won the Nobel Peace Prize for his work. These micro-loans give opportunity to those who are not otherwise able to obtain financing for their entrepreneurial ideas. This can be applied in the U.S. too, in places where the population cannot otherwise obtain loans to start small businesses and climb their way out of poverty. It can be very difficult to start from rock bottom in the US, especially for those without access to resources. If people could obtain small loans and start small businesses, they could work their way out of poverty. The microloans have to be viable for banks as well. The problem I’d like to explore is whether AI/ML can be used to determine how to deploy microloans efficiently to address income inequality in the U.S.",
"project_id": "aeb3a7e9bdee8ba99533d1e263521562419a8fb6",
"expand": false,
"tags": [
"Economics",
"Microfinance",
"Inequality",
"Published"
],
"published": true,
"science_fairs": "Massachusetts Science and Engineering Fair (MSEF)",
"related_proj": [
5,
26,
186
]
},
{
"student_name": "Roberto T.",
"mentor_name": "Sophia Barton",
"mentor_title": "Computer Science MS from Stanford",
"mentor_image": "assets/images/mentor_imgs/56c2b2fb8dc613f5a0a7ae222a5ee595734e2950.png",
"student_image": "assets/images/student_imgs/3794c48b2334ab46228d00da2c6c7e612e5bc9e9.png",
"domains": [
"Physics",
"Published Papers and Science Fairs"
],
"project_title": "Stellar Classification based on Numerous Characteristics using Machine Learning",
"project_desc": "The task of stellar classification can be tedious and\nlengthy when done manually. One can expedite stellar classifi-\ncation by creating an artificial intelligence model to automate\nthe process. As we as a species continue to explore the frontier\nof the observable universe, we should seek to automate time\nintensive problems like stellar classification. The current stellar\nclassification model serves to effectively categorize stars for re-\nsearch purposes regarding their distribution around the universe,\nso automating the development of this resource would allow\nprofessionals to allocate more time to explore the bounds of our\ncurrent understanding of space and the universe. After finding\nand analyzing a dataset containing numerical and categorical\nfeatures, a supervised learning approach was then used to train\nand test different models on their ability to classify the stars\nin the given test set. A Decision Tree Classifier, Random Forest\nClassifier, Ridge Classifier, and Support Vector Classifier were\ntrained and tested using the data. The most successful models\nwere the Decision Tree Classifier and Random Forest Classifier,\neach with about a 94 percent prediction accuracy across different\naccuracy metrics on the test data. Despite some drawbacks in\nregards to the availability of usable data, four models were\ntrained and two were proven to be consistently and successfully\naccurate. Any future attempts at developing models for stellar\nclassification should concentrate more on gathering data as to\nhave a more thoroughly trained set of models.",
"research_paper": "assets/pdfs/7b342766e5c2f5ed3fcf18d4910c0e3513323044.pdf",
"project_yr": "2022",
"project_quarter": "Fall",
"graphic_link": "https://www.universetoday.com/wp-content/uploads/2008/11/tarantula.jpg",
"headline": "The task of stellar classification can be tedious and lengthy when done manually. One can expedite stellar classification by creating an artificial intelligence model to automate the process. The current stellar classification model serves to effectively categorize stars for research purposes regarding their distribution around the universe, so automating the development of this resource would allow professionals to allocate more time to explore the bounds of our current understanding of space and the universe. After finding and analyzing a dataset containing numerical and categorical features, a supervised learning approach was then used to train and test different models on their ability to classify the stars in the given test set. A Decision Tree Classifier, Random Forest Classifier, Ridge Classifier, and Support Vector Classifier were trained and tested using the data. ",
"project_id": "5ee1177c671abf3e11897fc2ebbba3e1355573d4",
"expand": true,
"tags": [
"Astronomy",
"Stellar Classification",
"Published"
],
"published": true,
"publications": "Journal of Student Research",
"publication_link": "https://www.jsr.org/hs/index.php/path/article/view/4375",
"related_proj": [
60,
18,
36
]
},
{
"student_name": "Elgin V.",
"mentor_name": "Joseph Vincent",
"mentor_title": "Aerospace Engineering PhD Candidate at Stanford",
"mentor_image": "assets/images/mentor_imgs/808f2a4f0dad138963d86e72fa8a6bca1820e194.png",
"student_image": "assets/images/student_imgs/235e3e4102d4ee2526e5af8047f3ed72e18f083f.png",
"domains": [
"Sports and Motion Sciences",
"Published Papers and Science Fairs"
],
"project_title": "Predicting Running Injuries with Machine Learning Models",
"project_desc": "Is it possible to predict running injuries with only a dataset and machine learning models? This paper explores this question by using classification models, including the Logistic Regression model and the Random Forest Classifier model. In the dataset used, ten features were taken into account when predicting running injuries. With slight modifications, the Weighted Logistic Regression and over and down-sampling Random Forest Classifier models were used to mitigate the imbalance in the dataset. The results suggested that the best model was Weighted Logistic Regression and that the best score metric to take into account was the F beta score.",
"research_paper": "assets/pdfs/be88cd8430de161d40c89667573175ca23a8a90e.pdf",
"project_yr": "2022",
"project_quarter": "Summer",
"graphic_link": "https://www.runtastic.com/blog/wp-content/uploads/2021/05/thumbnail_1200x800-2.jpg",
"headline": "Is it possible to predict running injuries with only a dataset and machine learning models? This paper explores this question by using classification models, including the Logistic Regression model and the Random Forest Classifier model.",
"project_id": "8f11c9cea7ec6c6ef1b452880aceade84635e612",
"expand": false,
"tags": [
"Sports",
"Running",
"Logistic Regression"
],
"published": true,
"publications": "Journal of Student Research",
"publication_link": "https://www.jsr.org/hs/index.php/path/article/view/4046",
"related_proj": [
13,
15,
0
]
},
{
"student_name": "Rayyan M.",
"mentor_name": "Philip Bell",
"mentor_title": "Computer Science MS from Georgia Tech, Harvard and Oxford alum, AI Engineer at Cognino ",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/student_imgs/eef1734ba79e5067353abc408ba77616a654a8e7.png",
"domains": [
"Environment",
"Published Papers and Science Fairs"
],
"project_title": "Combating Climate Fake News Using NLP",
"project_desc": "As fake news becomes more prevalent across the US, important issues become harder to solve. One such issue is climate change, where climate misinformation has worsened viewer’s abilities to distinguish between fake information and real information. This project’s objective is to tackle climate misinformation using an artificial intelligence model. The model utilizes a BERT model tested on the “climate_fever” dataset to classify whether climate-related claims are true based on pieces of evidence.",
"research_paper": "assets/pdfs/d80b43b85a066c44470644a3696aca2d9ecfac57.pdf",
"project_yr": "2022",
"project_quarter": "Summer",
"github": "https://github.com/SpicyTaco17/Climate-Fake-News",
"project_webpage": "https://spicytaco17-climate-fake-news-app-57cfkv.streamlit.app/",
"graphic_link": "https://media-cldnry.s-nbcnews.com/image/upload/newscms/2018_25/2470661/189619-weather-forecasting-mn-1620.jpg",
"headline": "As fake news becomes more prevalent across the US, important issues become harder to solve. One such issue is climate change, where climate misinformation has worsened viewer’s abilities to distinguish between fake information and real information. This project’s objective is to tackle climate misinformation using an artificial intelligence model.",
"project_id": "1dbfe39aa305fa7f40158cb17cd70a167c56d275",
"expand": false,
"tags": [
"News",
"Fake News",
"Climate Change",
"Natural Language Processing"
],
"published": true,
"science_fairs": "Santa Clara ISEF Qualifier",
"related_proj": [
27,
67,
2
]
},
{
"student_name": "Arnav D.",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/missing_image.png",
"domains": [
"Healthcare and Biology",
"Published Papers and Science Fairs"
],
"project_title": "The Differentiation of Viral and Bacterial Pneumonia using Deep Learning",
"project_desc": "This project aims to find out whether a Convolutional Neural Network can be used to classify x-ray scans as having either bacterial or viral Pneumonia. First, its use is discussed, as it is meant to be used to assist with pulmonary healthcare in underprivileged areas. Then, its structure is outlined as a CNN set up using Keras and Tensorflow that is trained on a dataset consisting of 5856 x-ray scans of patients one to five years old from Guangzhou Women and Children’s Medical Center. After testing multiple setups, the final model, based on Inception ResNet v2, differentiates between the two types of Pneumonia with a 75% test accuracy, using minimal epochs in order to increase efficiency. The model was further improved by using dropout, the implementation of early stopping, L2 regularization and augmentation of the training data in order to combat overfitting. Lastly, the model’s limitations and future are discussed. This model proves one can predict and identify types of Pneumonia using a CNN with relatively high accuracy and shows the potential of deep learning in the context of medical imaging to assist medical staff.",
"project_yr": "2022",
"project_quarter": "Fall",
"graphic_link": "https://thorax.bmj.com/content/thoraxjnl/57/5/438/F1.large.jpg",
"headline": "This project aims to find out whether a Convolutional Neural Network can be used to classify x-ray scans as having either bacterial or viral Pneumonia. ",
"project_id": "58dec3622ea48a99256e07cb547de3f8418e0c94",
"expand": false,
"tags": [
"Diseases and Health Conditions",
"Pneumonia",
"Classification",
"Published"
],
"published": true,
"science_fairs": "2nd Place at Orange County Science Fair, California Science and Engineering Fair (CSEF)",
"related_proj": [
98,
119,
16
]
},
{
"student_name": "Avyukth H.",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/student_imgs/431caaf556b23766fb3909779a4d98cb8b62de72.png",
"domains": [
"Healthcare and Biology",
"Published Papers and Science Fairs"
],
"project_title": "Impact of Class Weights and Feature Importance in Automated Stroke Detection",
"project_desc": "A stroke occurs when a blood vessel that carries oxygen and nutrients to the brain is either blocked by a clot or ruptures. Consequently, part of the brain is unable to obtain the blood (and oxygen) it needs, so brain cells die. This makes it important to be able to assess the probability of a stroke given features that are specific to patients so that they could take preventative measures in the future. Thus, the unpredictability and deadliness of strokes pose the following questions: Can we predict the occurrence of a stroke from few facts about the patient easily accessible by the doctor? What are the most important features for stroke prediction? In this project, we investigate the feasibility of using a supervised machine learning model to predict stroke occurrence. In practice, however, we faced challenges such as low prevalence and the imbalance in the available dataset, with many more negative than positive cases. In this research paper, we do a parametric study of class weighting as a way to tackle imbalance during training. We then infer the most important features that should be taken into consideration for stroke prediction. Assessing feature importance allows for patients to focus on two or three areas that may be contributing to their high probability for getting a stroke. The most significant result was that the most important feature that should be considered when determining the probability of an individual getting a stroke is age. However, there is no distinct second most important feature. Additionally, a non monotonic improvement with a class weight of 22.5 for positive cases in this dataset produces the most optimal results. \n",
"research_paper": "assets/pdfs/070cedde410910544ceefd49565631e69a685b9a.pdf",
"project_yr": "2022",
"project_quarter": "Summer",
"graphic_link": "https://www.medrxiv.org/content/medrxiv/early/2021/04/07/2021.04.02.21253083/F1.large.jpg",
"headline": "In this research paper, we do a parametric study of class weighting as a way to tackle imbalance during training. We then infer the most important features that should be taken into consideration for stroke prediction. ",
"project_id": "c5872d603a0e4ff94f5ecab64817175d43b9961c",
"expand": true,
"tags": [
"Diseases and Health Conditions",
"Strokes",
"Neuroscience"
],
"published": true,
"publications": "Curieux Academic Journal",
"related_proj": [
67,
2,
19
]
},
{
"student_name": "Avnith V.",
"mentor_name": "Jacklyn Luu",
"mentor_title": "Biomedical Informatics MS at Stanford",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/missing_image.png",
"domains": [
"Healthcare and Biology",
"Published Papers and Science Fairs"
],
"project_title": "Evaluating Machine Learning Models on Predicting Change in Enzyme Thermostability",
"project_desc": "Enzymes are efficient catalysts for biological reactions, and can potentially be designed to speed up non-biological reactions, such as reactions in industrial processes. However, physically experimenting with new protein designs is time consuming, and an efficient method to predict protein stability is needed. Our research problem is finding the best machine learning model to predict the change in enzyme thermostability after a single point mutation in the amino acid sequence. We trained several machine learning models and found that the XGBoost model had the best performance with an R2 score of 0.593 (R2 score is a metric where higher is better and a perfect model would have a score of 1). ",
"research_paper": "assets/pdfs/2534c03998d293d48a8b45f9a7341724d2d1e3df.pdf",
"project_yr": "2022",
"project_quarter": "Fall",
"github": "https://github.com/avnithv/enzyme-thermostability-project",
"graphic_link": "https://www.innovationnewsnetwork.com/wp-content/uploads/2020/06/cold-adapted-enzymes-696x392.jpg",
"headline": "Our research problem is finding the best machine learning model to predict the change in enzyme thermostability after a single point mutation in the amino acid sequence. ",
"project_id": "c6626d41c51d0fda47297103ef1343bd58a9b435",
"expand": true,
"tags": [
"Biology",
"Chemistry",
"DNA",
"Gradient Boosting"
],
"published": true,
"publications": "Journal of Student Research",
"publication_link": "https://www.jsr.org/hs/index.php/path/article/view/4364",
"related_proj": [
2,
21,
73
]
},
{
"student_name": "Anjali S.",
"mentor_name": "Katie O'Nell",
"mentor_title": "PhD Student at Dartmouth, Brain and Cognitive Sciences BS from MIT, Ethics+Social Sciences project and course developer at Inspirit AI",
"mentor_image": "assets/images/mentor_imgs/3df727e01bd6076d80bdefdd1c47288c998e0f7e.png",
"student_image": "assets/images/missing_image.png",
"domains": [
"Healthcare and Biology",
"Humanitarian Efforts and Social Justice"
],
"project_title": "Differences in predicted rates of vaginal births after cesarean across racial groups in a ‘race-neutral’ model",
"project_desc": "When physicians and pregnant patients make decisions about whether to pursue a vaginal birth or cesarean, there are many factors at play. While vaginal birth can have health benefits for both parent and child, there are significant safety risks. In order to minimize these risks, physicians use predictive models to determine how likely patients are to have successful vaginal births after cesareans (VBAC). For many years, these predictive models included race as a variable. This decision recently came under fire, and the Maternal Fetal Medicine Unit (MFMU) published a calculator that did not include race as a variable, but still predicted VBAC success with high accuracy. A large body of work in machine learning has highlighted that supposedly de-biased systems often re-code sensitive variables like race in terms of proxy variables. In order to determine if this was the case in this calculator, we replicated their formula, then found base-rate statistics of all the input variables for three different racial groups: Black, White, and Asian. We found that the distribution of VBAC probabilities for our simulated patients from these three groups were, indeed, significantly different from each other. Further, the predicted VBAC rates increased as a function of societal marginalization: Black patients were 47.6% likely to have a successful VBAC, Asian patients had a 48.6% probability, and White patients had a 49.4% probability. While these values are all within a few percentage points of each other, the differences in these simulated distributions shows how there may still be underlying disparities in the maternal healthcare system.",
"research_paper": "assets/pdfs/4a1e29bb400296758e8d804dbefc5ac88a058f7f.pdf",
"project_yr": "2022",
"project_quarter": "Summer",
"graphic_link": "https://www.verywellfamily.com/thmb/m5vCHw2YW_zteiIwmYXmJ2QRosE=/1500x0/filters:no_upscale():max_bytes(150000):strip_icc()/AA6030-001-56a7713e3df78cf77295fbe7.jpg",
"headline": "A large body of work in machine learning has highlighted that supposedly de-biased systems often re-code sensitive variables like race in terms of proxy variables. In order to determine if this was the case in this calculator, we replicated their formula, then found base-rate statistics of all the input variables for three different racial groups: Black, White, and Asian.",
"project_id": "72bcaf1817b1e88e623d2b3f28f29d58dd65d048",
"expand": true,
"tags": [
"Diseases and Health Conditions",
"Women's Health",
"Racial Equality",
"Gender Equality"
],
"published": true,
"publications": "medRxiv Medical Ethics preprint",
"publication_link": "https://www.medrxiv.org/content/10.1101/2023.10.14.23296978v1",
"related_proj": [
167,
104,
184
]
},
{
"student_name": "Shanzeh H.",
"mentor_name": "Odysseas Drosis",
"mentor_title": "PhD Candidate in Computer Science, Masters in Computer Science Alum from Cornell",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/student_imgs/79f656245a5e9bd9b5892bae9b8c3fb9ad33f352.png",
"domains": [
"Healthcare and Biology",
"Published Papers and Science Fairs"
],
"project_title": "The Utilization of Artificial Intelligence in Enabling the Early Detection of Brain Tumors",
"project_desc": "Diagnosing brain tumors is challenging due to their location and varied presentations\nthat may mimic common disorders. A cancer diagnosis can be missed even when advanced imaging is conducted due to interpretive error or an incompatible clinical history as presented. Machine learning, when applied to radiological imagery, can alert physicians earlier to the presence of tumors and improve diagnostic evaluation. This enhanced evaluation can lead to earlier detection of malignant tumors and positively improve prognosis, quality of life, and treatment. This research aims to investigate the application of machine learning to enhance diagnosis. The study developed two machine learning models, a logistic regression model, and a neural network model. We hypothesized that our methods would work sufficiently, proving the correct diagnosis rate, especially within the neural networks model, as they are more complicated in nature. Applying a dataset sourced from Kaggle into the respective algorithms showcased a promising future of machine learning applications to brain tumor diagnosis, with test accuracies in the logistic regression model high (68%) and the neural network model at a significant high (84%). The simple fact of achieving a 84% accuracy rate on new data in the neural network model represents a promising future for the early detection of brain tumors.\n",
"research_paper": "assets/pdfs/00fd300026214ae10e0b11d2cd3da2869f219dba.pdf",
"project_yr": "2022",
"project_quarter": "Summer",
"graphic_link": "https://scopeblog.stanford.edu/wp-content/uploads/2022/10/Scope-Arami-Nanostars-1024x578.jpg",
"headline": "This research aims to investigate the application of machine learning to enhance diagnosis.",
"project_id": "0a26e050296441b0075ca224251dbbc48c833cb3",
"expand": true,
"tags": [
"Diseases and Health Conditions",
"Strokes",
"Neuroscience",
"Published",
"Logistic Regression"
],
"published": true,
"publications": "Journal of Emerging Investigators",
"related_proj": [
133,
131,
135
]
},
{
"student_name": "Ashray P.",
"mentor_name": "Christopher Mauck",
"mentor_title": "Computer Science Masters from MIT",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/missing_image.png",
"domains": [
"Sports and Motion Sciences",
"Published Papers and Science Fairs"
],
"project_title": "Optimizing Prediction Accuracy Using Advanced Ensemble And Voting Classifier Methods",
"project_desc": "Machine learning models have become more advanced over the last few years, allowing for increased prediction accuracy. One relatively undiscovered model-making technique is the Voting Classifier, which is able to combine the predictions of different ML models to arrive at one output prediction. Harnessing this method can allow for increased accuracy and revolutionize the future of Artificial Intelligence (AI) and Machine Learning (ML).\n\nThis project observes how various machine learning models, once tuned, can further be combined to create a complex model that uses NFL data from the past 18 years to predict the outcomes of matchups between any two competing teams. Through an extensive study of various existing models and key parameters that are critical to the sport of American football, a sophisticated advanced model was created that is highly sensitive to a handful of most important parameters in accurately predicting a given matchup at a level that consistently matches or beats Vegas predictions.",
"research_paper": "assets/pdfs/b973c789e1df2e7826d3db14ebfcdcb3bffa19f9.pdf",
"project_yr": "2022",
"project_quarter": "Summer",
"project_webpage": "https://avp123-nflstreamlit-nflapp-z925xp.streamlit.app/",
"graphic_link": "https://dims.apnews.com/dims4/default/53279da/2147483647/strip/false/crop/3000x2000+0+0/resize/1486x991!/quality/90/?url=https%3A%2F%2Fstorage.googleapis.com%2Fafs-prod%2Fmedia%2Fc94ccae5a2354fe3a486e891c2b781e4%2F3000.jpeg",
"headline": "This project observes how various machine learning models, once tuned, can further be combined to create a complex model that uses NFL data from the past 18 years to predict the outcomes of matchups between any two competing teams.",
"project_id": "f6c0acfb6db69f9cc850d744d56092222ac474e4",
"expand": true,
"tags": [
"Sports",
"Pop Culture",
"Published",
"Prediction"
],
"published": true,
"science_fairs": "2nd Place at Contra Costa Science and Engineering Fair",
"related_proj": [
22,
16,
8
]
},
{
"student_name": "Armita K.",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/missing_image.png",
"domains": [
"Healthcare and Biology",
"Published Papers and Science Fairs"
],
"project_title": "Diagnosing Brain Tumors from MRI Images Using Deep Transfer Learning",
"project_desc": "Each year, more than 100,000 people in the United States are diagnosed with a brain tumor. An early and accurate diagnosis is crucial in getting patients the necessary treatment and increasing survival rates. In recent years, machine learning algorithms have become increasingly popular in the medical field due to their ability to recognize complex patterns and reduce human errors. However, accurate diagnosis using deep learning algorithms requires a large amount of training data, which is not always available. Additionally, training a model from scratch can take a long time and requires vast amounts of computational power. As a solution, this study aims to utilize a transfer learning method in which the prior knowledge of a pretrained model is used to aid in a new classification problem. In this study, a dataset of MRI images consisting of four classes (no tumor, pituitary tumor, meningioma, and glioma) were used. The performance of seven pretrained models (ResNet18, ResNet50, VGG16, DenseNet, GoogLeNet, ShuffleNet, and MobileNet) were evaluated in order to see which would achieve the highest classification accuracy. Additionally, this study examined two different methods for the implementation of transfer learning. In the first method, all layers of the pretrained model were frozen and in the second method, all layers of the pretrained model were trained. The best performing models proved to be ResNet18 and ShuffleNet with all layers trained, achieving an accuracy of 97.86%. The results also showed that the unfrozen models outperformed their frozen counterparts. \n",
"research_paper": "assets/pdfs/e1797a211b5825afc3630eb6b81d399b57f31911.pdf",
"project_yr": "2022",
"project_quarter": "Fall",
"graphic_link": "https://assets.medpagetoday.net/media/images/99xxx/99832.jpg?width=0.6",
"headline": "This study aims to utilize a transfer learning method in which the prior knowledge of a pretrained model is used to aid in a new classification problem.",
"project_id": "d50b4f55ed899b280df774d50ecd798c54043892",
"expand": false,
"tags": [
"Diseases and Health Conditions",
"Strokes",
"Neuroscience",
"Published"
],
"published": true,
"publications": "Journal of High School Research",
"publication_link": "https://jhss.scholasticahq.com/article/89930",
"related_proj": [
3,
68,
133
]
},
{
"student_name": "Dhruv M.",
"mentor_name": "Odysseas Drosis",
"mentor_title": "PhD Candidate in Computer Science, Masters in Computer Science Alum from Cornell",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/missing_image.png",
"domains": [
"Healthcare and Biology",
"Published Papers and Science Fairs"
],
"project_title": "Diversified AI Techniques for Augmenting Brain Tumor Diagnosis",
"project_desc": "Brain tumors affect thousands of people worldwide each year and can be extremely fatal if not diagnosed early They are challenging to diagnose due to their complexity and the overlapping features of different tumor types. This research explores the application of AI technology to expedite the diagnosis of brain tumors. The proposed AI-based approaches involve using deep learning algorithms to analyze medical imaging data, specifically MRI scans. The goal was to build a robust and accurate model that could overcome distribution shifts. Some of the models used include classical machine learning models and a convolutional neural network. The results demonstrate that AI-based approaches can significantly improve the accuracy and expedite the process of brain tumor diagnosis. The performances of the models were evaluated by using cross-validation and measuring accuracy, using a completely different dataset of MRI scans, to assess how the models performed when dealing with distribution shifts. The logistic regression model achieved a testing accuracy of 78.56%. The multi-layer perceptron (MLP) model achieved a testing accuracy of 74.89%. The multiplicative weight update method combined two models\n(MLP and logistic regression) with dynamically adjusted weights and achieved a testing accuracy of 83.83%. An approach where multiple aggregating models were used collaboratively achieved a testing accuracy of 86.46%. The best performing model, a convolutional neural network (CNN), yielded a testing accuracy of 98.20%.",
"research_paper": "assets/pdfs/a1958ab872b0c13f190ad9056b3cdc9807b3356e.pdf",
"project_yr": "2022",
"project_quarter": "Fall",
"github": "https://github.com/hello11210/AI-for-Brain-Tumor-Diagnosis",
"graphic_link": "https://cdn.images.express.co.uk/img/dynamic/11/590x/1682717_1.jpg",
"headline": "This research explores the application of AI technology to expedite the diagnosis of brain tumors.",
"project_id": "cc63fba2402223c028df68372b1779aa8e947df6",
"expand": false,
"tags": [
"Diseases and Health Conditions",
"Strokes",
"Neuroscience",
"Published"
],
"published": true,
"publications": "Journal of Student Research",
"related_proj": [
101,
133,
185
]
},
{
"student_name": "Surya K.",
"mentor_name": "Sriram Hathwar",
"mentor_title": "Princeton Alum, Bioinformatics Researcher",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/student_imgs/e882056b2810f35a11fd1195b408148cdebea4d8.png",
"domains": [
"Healthcare and Biology",
"Published Papers and Science Fairs"
],
"project_title": "Diagnosing Hypertrophic Cardiomyopathy Using Machine Learning Models on CMRs and EKGs of the Heart",
"project_desc": "Hypertrophic cardiomyopathy (HCM) is a common inherited heart disorder manifesting as hypertrophy of the left ventricle of the heart. However, it often goes undiagnosed, which we must seek to avoid since the possibility of sudden cardiac death (SCD) as a result of HCM is not insignificant. In this project, we presented a pair of models, one CNN model and one Long Short Term Memory (LSTM) model, that are capable of classifying cardiac magnetic resonance (CMR) and heart electrocardiogram (EKG) scans, respectively. We hypothesized that machine learning techniques applied to CMR and EKG data can predict HCM with a high degree of accuracy, precision, recall, and F1 Score. Each of these models classifies their respective scans into HCM and non-HCM categories. The CNN model has an accuracy of 94.71%, a precision of 96.97%, a recall of 91.21%, and an F1 score of 94.85%. The LSTM model has an accuracy of 90.51%, a precision of 60.31%, a recall of 60.08%, and an F1 score of 60.19%. These results show that these machine learning models are viable tools that could assist physicians in the diagnosis of HCM patients.",
"research_paper": "assets/pdfs/e305dc45b1659fe5cba7c86b83075c46b1023f9f.pdf",
"project_yr": "2022",
"project_quarter": "Summer",
"github": "https://github.com/suryakolluri6/HCM_diagnostic_models",
"graphic_link": "https://www.ucl.ac.uk/news/sites/news/files/heart_stock_cropped_0.jpg",
"headline": "In this project, we presented a pair of models, one CNN model and one Long Short Term Memory (LSTM) model, that are capable of classifying cardiac magnetic resonance (CMR) and heart electrocardiogram (EKG) scans, respectively.",
"project_id": "3d64a57084fa3a9804772cbb3d3518cebdd44bea",
"expand": true,
"tags": [
"Diseases and Health Conditions",
"Heart Disease",
"Published"
],
"published": true,
"publications": "Journal of Emerging Investigators",
"related_proj": [
34,
42,
108
]
},
{
"student_name": "Katarina A.",
"mentor_name": "Ayush Pandit",
"mentor_title": "PhD Candidate at Stanford, prior Stanford Bioengineering Alum",
"mentor_image": "assets/images/mentor_imgs/8ebea060f7acf53ebda820482eaa4ee26d37df4b.png",
"student_image": "assets/images/student_imgs/16e627973984da51e6911c2796361a48a3207583.png",
"domains": [
"Environment",
"Published Papers and Science Fairs"
],
"project_title": "AI-Based Image Classification Used to Accurately Distinguish Recyclable Material Versus Non-Recyclable Material",
"project_desc": "In today’s world, pollution is increasing as plastics and other materials are not recycled properly, resulting in landfills. One cause of this improper disposal of materials is that it can be difficult to tell if a material is able to be recycled. In response, I created a machine learning model that can distinguish recyclable materials from trash through image classification. For my model, I used a dataset called trash-net. I first extracted the contents of the data and resized the dataset in order to have better organization. There are six categories within the dataset: cardboard, glass, metal, paper, plastic, and trash, that the images are organized in. I used resnet34 which is a pre-trained convolutional neural network (CNN) in order to perform the image classification. Afterwards, I trained my model by running the program repeatedly and then tested it by seeing if it accurately predicted if a material was recyclable or not. Lastly, I used matplotlib to visualize the results. The accuracy of the model ended up being about 88%. Generally, if the accuracy of a machine-learning model is higher than 50% then it performs relatively well. With more training and a greater number of images, the program could potentially increase in accuracy. In conclusion, I think my model would help as it could generally classify if a material was recyclable or not. However, an application in which the user directly scans an item would be more useful. Nevertheless, my model performed well and enabled me to learn more about the use of artificial intelligence. \n\n",
"research_paper": "assets/pdfs/0f175d18ad7de4d88554f88bca3a0959c5b2c2fc.pdf",
"project_yr": "2022",
"project_quarter": "Summer",
"graphic_link": "https://s3-prod.crainsnewyork.com/s3fs-public/REAL_ESTATE_130729880_AR_-1_0.jpg",
"headline": "One cause of this improper disposal of materials is that it can be difficult to tell if a material is able to be recycled. In response, I created a machine learning model that can distinguish recyclable materials from trash through image classification.",
"project_id": "3dd373e176952ff4830b8d1d532e0973054b9d40",
"expand": true,
"tags": [
"Environmentalism",
"Recycling",
"Sustainability"
],
"published": true,
"science_fairs": "Synopsys Science Fair",
"related_proj": [
11,
21,
120
]
},
{
"student_name": "Jack L.",
"mentor_name": "Peter Washington",
"mentor_title": "Computer Science Professor at UHawaii, CS PhD from Stanford",
"mentor_image": "assets/images/mentor_imgs/19db8abe6f5ba342397cabd447a93cbb5d008688.png",
"student_image": "assets/images/missing_image.png",
"domains": [
"Psychology",
"Published Papers and Science Fairs"
],
"project_title": "Investigating Data Augmentation Strategies for Computer Vision Facial Expression Recognition",
"project_desc": "Autism is a neurodevelopmental disorder. A major symptom is difficulty communicating and understanding social cues such as emotions. I aim to help people with autism better recognize emotions by developing improved artificial intelligence (AI) models to recognize facial expressions. Such models can be and have been integrated into digital therapeutics for children with autism. A crucial step to achieving performant models is to apply data augmentation to increase the dataset size and the generalization capacity. I compare and contrast data augmentation strategies on the Facial Expression Recognition (FER) 2013 dataset to determine which method leads to a maximal increase in performance. I then examine the benefit of data augmentation at various training set sizes. Among the strategies I evaluate, I find that shifting the width of the image provides the greatest increase to performance when compared to not applying data augmentation. Furthermore, I find that at several training dataset sizes ranging from 100 to 20,000 images, applying all data augmentation strategies consistently outperforms no data augmentation. These strategies can inform the development of digital therapies for autism which focus on the evocation and subsequent automatic detection of facial expressions.",
"research_paper": "assets/pdfs/16737c9b12a4149b57d291e8d624dca57062a193.pdf",
"project_yr": "2022",
"project_quarter": "Fall",
"graphic_link": "https://149695847.v2.pressablecdn.com/wp-content/uploads/2020/04/Learn-Facial-Recognition-scaled.jpg",
"headline": "I aim to help people with autism better recognize emotions by developing improved artificial intelligence (AI) models to recognize facial expressions.",
"project_id": "2157806554f790856dcb6cc95179a733d8161679",
"expand": true,
"tags": [
"Computer Vision",
"Facial Emotion",
"Developmental Disorders"
],
"published": true,
"publications": "Curieux Academic Journal",
"publication_link": "https://www.curieuxacademicjournal.com/_files/ugd/99711c_cae123aa84284189b8142087c8589d21.pdf",
"related_proj": [
15,
40,
103
]
},
{
"student_name": "Varsha N.",
"mentor_name": "Roger Jin",
"mentor_title": "Computer Science MS at MIT, ML researcher at Apple",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/missing_image.png",
"domains": [
"Healthcare and Biology"
],
"project_title": "A Novel Approach to Promote Equity in Skin Disease Diagnosis by AI Models",
"project_desc": "AI-based systems are increasingly used to diagnose skin diseases with datasets available on the internet. However, the training data predominantly represents fair skinned people. The goal is to increase the accuracy of existing models in diagnosing skin disease across various skin tones within 10% of that obtained in diagnosing fairer skin tones, which is about 95.8%. The publicly available HAM10000 dataset with 10000 fair skin images was preprocessed and used to fine-tune a Vision Transformer model trained on the ImageNet-21K dataset. The model obtained about 94.9% accuracy in diagnosing fair skin images and about 19.8% accuracy in diagnosing real non-fair skin images. Next, 50% of the training data was transformed into non-fair skin simulation by CycleGAN based on 320 real non-fair images. After several iterations, the model achieved about 78.6% accuracy in classifying any skin tone and 52.1% accuracy in classifying real non-fair images. Though this 68.4% is outside of the desired range, it still falls within the accuracy of dermatologists (48–77%). However, it is the accuracy in classifying non-fair skin that makes this model significant. The model achieved a 28.6% increase in accuracy in diagnosing non-fair images using simulated data thereby narrowing the disparity. A confusion matrix was plotted to visualize the validity of the predictions. The functionality of this model suggests that similar data augmentation techniques could be applied to other AI models to ensure their fairness to all categories of people and correct any biases in data due to historic under-representation.",
"research_paper": "assets/pdfs/2ee4416a991da2d5b43d584262c76998e321b45f.pdf",
"project_yr": "2022",
"project_quarter": "Summer",
"graphic_link": "https://www.vchri.ca/sites/default/files/styles/max_1300x1300/public/field/image/news_20180724_sunsray.jpg?itok=gFON3EMy",
"headline": "The goal is to increase the accuracy of existing models in diagnosing skin disease across various skin tones within 10% of that obtained in diagnosing fairer skin tones, which is about 95.8%.",
"project_id": "9941206fc0938ca237fdb718341d95aff57a5192",
"expand": false,
"tags": [
"Skin Cancer",
"Diseases and Health Conditions",
"Diseases",
"Science Fairs"
],
"published": true,
"science_fairs": "3rd Place in the Alameda County Science Fair",
"related_proj": [
29,
133,
9
]
},
{
"student_name": "Cherry (.",
"mentor_name": "Shreya Parchure",
"mentor_title": "MD-PhD student at UPenn",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/missing_image.png",
"domains": [
"Healthcare and Biology",
"Published Papers and Science Fairs"
],
"project_title": "Machine Learning Approaches to Detect Brain Tumors from Magnetic Resonance Imaging Scans",
"project_desc": "Artificial intelligence (AI) models have brought about a significant transformation in various industries, including healthcare, in recent years. Among the many areas benefiting from AI, brain tumor detection has seen remarkable advancements. Accurate brain tumor detection plays a crucial role in the timely diagnosis and treatment of neurological disorders. Thanks to AI models, the process of detecting brain tumors has become more precise and efficient. In our study, we utilized a comprehensive dataset of brain magnetic resonance imaging (MRI) scans to compare and assess the performance of different baseline AI models. These models included the K-Nearest Neighbors (KNN) Classifier, Logistic Regression, Decision Tree Classifier, and Multi-Layer Perceptron (MLP). Our analysis revealed that the KNN Classifier yielded the highest accuracy at 88.5%, making it the most suitable AI baseline model for brain tumor detection. These findings underscore the potential of AI models in achieving accurate and efficient brain tumor detection, paving the way for further advancements in this technology.",
"project_yr": "2022",
"project_quarter": "Winter",
"graphic_link": "https://storage.googleapis.com/treatspace-prod-media/pracimg/u-2740/shutterstock_2140317261.jpeg",
"headline": "Our study utilized a comprehensive dataset of brain magnetic resonance imaging (MRI) scans to compare and assess the performance of different baseline AI models.",
"project_id": "b8f192e089d782f55aba98d180b19be10e08b51c",
"expand": true,
"tags": [
"Neuroscience",
"Biotechnology"
],
"published": true,
"publications": "Journal of Student Research",
"publication_link": "https://www.jsr.org/hs/index.php/path/article/view/5601",
"related_proj": [
10,
11,
103
]
},
{
"student_name": "Sam L.",
"mentor_name": "Ivan Villa-Renteria",
"mentor_title": "MS Computer Science at Stanford",
"mentor_image": "assets/images/mentor_imgs/c3367c49c4a121b7eae2189efc7662edd8de4720.png",
"student_image": "assets/images/student_imgs/c3e842b38827667b0ff8824ec9e23d2bcc1854ba.png",
"domains": [
"Healthcare and Biology"
],
"project_title": "Using Machine Learning to Detect Alzheimer’s Disease in MRI Scans",
"project_desc": "Alzheimer’s Disease (AD) is a neurological disorder that slowly eats at the brain and affects a patient’s memory, thoughts, and behavior. It often develops in the later stages of life, and is a very heartbreaking disease to watch a loved one go through. As neurologists get closer to finding a cure for Alzheimer’s Disease (AD), it’s still necessary to catch the disease in its early stages to ensure the best quality of life for those who have it. Along with quality of life, it’s important to know if a patient has AD in order to protect them from autoimmune disorders that can worsen their symptoms or pose a threat due to their vulnerable state. We aimed to answer the question about if Magnetic Resonance Imaging (MRI) scans, which are often used in the diagnosing of other neurological disorders, can be used to diagnose AD in patients. While neurologists have already attempted this, we wondered if we could take it a step further by using machine learning to classify the data and separate it into different categories of dementia in order to properly diagnose a patient, as well as determine the severity of their AD. While conducting this research, we found surprising results, shown in how one of our inferior models reported nearly perfect accuracy while a model that was supposed to be superior reported a significantly poorer accuracy. Overall, our models reported back accuracies over 70%. These models also had to be incredibly complicated, with the image data being put into it having to be processed through hundreds of layers if we wanted a satisfactory accuracy. We concluded that while machine learning did report back very high accuracy, it is not a perfect tool for detecting fallacies in MRI scans, it is not meant to be the sole diagnostic for AD. ",
"research_paper": "assets/pdfs/23349d22dd4128ee293754104a5f0dee4ea629b8.pdf",
"project_yr": "2023",
"project_quarter": "Summer",
"github": "https://github.com/SamLizotte/Machine-Learning-to-Detect-Alzheimers-Disease/tree/main",
"graphic_link": "https://news.weill.cornell.edu/sites/default/files/styles/news_story_featured_image/public/story_featured_images/shutterstock_1111126898.jpg?itok=sURNjAQO&c=67d68c7ce0dba03a2ea246ef534ff826",
"headline": "We aimed to answer the question about if Magnetic Resonance Imaging (MRI) scans, which are often used in the diagnosing of other neurological disorders, can be used to diagnose AD in patients. ",
"project_id": "021f95a954214e2cd5744063595ab1f3bc876923",
"expand": false,
"tags": [
"Neuroscience",
"Diseases and Health Conditions"
],
"published": true,
"publication_link": "https://mareg5.zfairs.com/File/Client/efd07aa7-d1f7-43cc-8292-3bf60118f1f7?ext=.pdf",
"related_proj": [
10,
137,
133
]
},
{
"student_name": "Aarav S.",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/missing_image.png",
"domains": [
"Physics",
"Published Papers and Science Fairs"
],
"project_title": "Exploring Asteroid Orbits: Insights from Neural Network Modeling and Data-driven Analysis",
"project_desc": "\tClassifying the orbits of asteroids contributes to research on the Solar System’s formation and evolution, making the classification of orbits a fundamental aspect of space exploration. However, accurate orbit determination is often hindered by skewed observational data stemming from uneven and limited sky coverage. In this study, orbital data from the NASA Jet Propulsion Lab was used in the classification of asteroid orbits through a machine-learning approach. Due to imbalances in the dataset, Synthetic Minority Oversampling Technique (SMOTE) was used to compensate for limited observations of certain orbit types. Afterwards, several classification models were implemented using specific orbital features. The most accurate results in this study were produced by a custom Keras neural network, with similar results demonstrated by an MLP neural network and k-nearest neighbors model. The custom neural network was able to effectively distinguish between orbital patterns, as demonstrated by its 99.18% validation accuracy and nearly diagonal confusion matrix. The capability of these models not only contributes to our understanding of asteroid orbits but also suggests potential improvements in orbit determination methodologies.",
"research_paper": "assets/pdfs/4ca6f413ddb888b88f2bb34db01883b14b03534d.pdf",
"project_yr": "2023",
"project_quarter": "Summer",
"graphic_link": "https://cdn.hswstatic.com/gif/gettyimages-1317904367.jpg",
"headline": "In this study, orbital data from the NASA Jet Propulsion Lab was used in the classification of asteroid orbits through a machine-learning approach.",
"project_id": "dbb1119fb778c16237dcc64e0d56987c55dc6f96",
"expand": true,
"tags": [
"Astronomy",
"Physics",
"Science Fairs"
],
"published": true,
"science_fairs": "Accepted for Exhibition at TNJSF",
"related_proj": [
6,
8,
99
]
},
{
"student_name": "Sam L.",
"mentor_name": "Ivan Villa-Renteria",
"mentor_title": "MS Computer Science at Stanford",
"mentor_image": "assets/images/mentor_imgs/c3367c49c4a121b7eae2189efc7662edd8de4720.png",
"student_image": "assets/images/student_imgs/c3e842b38827667b0ff8824ec9e23d2bcc1854ba.png",
"domains": [
"Healthcare and Biology",
"Published Papers and Science Fairs"
],
"project_title": "Using Machine Learning to Detect Alzheimer’s Disease in MRI Scans",
"project_desc": "Alzheimer’s Disease (AD) is a neurological disorder that slowly eats at the brain and affects a patient’s memory, thoughts, and behavior. It often develops in the later stages of life, and is a very heartbreaking disease to watch a loved one go through. As neurologists get closer to finding a cure for Alzheimer’s Disease (AD), it’s still necessary to catch the disease in its early stages to ensure the best quality of life for those who have it. Along with quality of life, it’s important to know if a patient has AD in order to protect them from autoimmune disorders that can worsen their symptoms or pose a threat due to their vulnerable state. We aimed to answer the question about if Magnetic Resonance Imaging (MRI) scans, which are often used in the diagnosing of other neurological disorders, can be used to diagnose AD in patients. While neurologists have already attempted this, we wondered if we could take it a step further by using machine learning to classify the data and separate it into different categories of dementia in order to properly diagnose a patient, as well as determine the severity of their AD. While conducting this research, we found surprising results, shown in how one of our inferior models reported nearly perfect accuracy while a model that was supposed to be superior reported a significantly poorer accuracy. Overall, our models reported back accuracies over 70%. These models also had to be incredibly complicated, with the image data being put into it having to be processed through hundreds of layers if we wanted a satisfactory accuracy. We concluded that while machine learning did report back very high accuracy, it is not a perfect tool for detecting fallacies in MRI scans, it is not meant to be the sole diagnostic for AD. ",
"research_paper": "assets/pdfs/23349d22dd4128ee293754104a5f0dee4ea629b8.pdf",
"project_yr": "2023",
"project_quarter": "Summer",
"github": "https://github.com/SamLizotte/Machine-Learning-to-Detect-Alzheimers-Disease/tree/main",
"graphic_link": "https://d2jx2rerrg6sh3.cloudfront.net/images/news/ImageForNews_768815_17049474459941307.jpg",
"headline": "We aimed to answer the question about if Magnetic Resonance Imaging (MRI) scans, which are often used in the diagnosing of other neurological disorders, can be used to diagnose AD in patients.",
"project_id": "021f95a954214e2cd5744063595ab1f3bc876923",
"expand": false,
"tags": [
"Alzheimer's",
"Diseases and Health Conditions",
"Science Fairs"
],
"published": true,
"science_fairs": "2nd place at Massachusetts Region V Science Fair",
"publication_link": "https://mareg5.zfairs.com/File/Client/efd07aa7-d1f7-43cc-8292-3bf60118f1f7?ext=.pdf",
"related_proj": [
68,
4,
178
]
},
{
"student_name": "Emin C.",
"mentor_name": "Odysseas Drosis",
"mentor_title": "PhD Candidate in Computer Science, Masters in Computer Science Alum from Cornell",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/student_imgs/fabc7768e36a22d4f25408e974243604e9913227.png",
"domains": [
"Business and Finance",
"Published Papers and Science Fairs"
],
"project_title": "Artificial Intelligence in the Stock Market: Predicting Prices",
"project_desc": "This research project focuses on stock price prediction through A.I. models as well as machine learning algorithms to maximize profit potential, improve investments, and eliminate risk. Essentially, this project will demonstrate the modern implementation of A.I. in predicting the stock market. Potentially lucrative company stocks and shares have attracted investors as well as general interest in the stock market for decades, leading more people to try to predict the rise or fall of market prices. However, industry volatility and the seemingly unpredictable nature of the stock market have led many buyers to invest impulsively, sell their shares at the wrong time, or purchase stock from the wrong company. To combat these problems, we trained and tested A.I. models on our collected, classified data in order to generate accurate predictions. These models achieved average prediction errors of 0.12% for the stock prices of Amazon, 0.13% for the stock prices of Google, and 0.07% for Microsoft’s stock prices on the testing datasets.\n",
"research_paper": "assets/pdfs/5f3f567abbe5616e25e2d29670bf1181e70f07d7.pdf",
"project_yr": "2022",
"project_quarter": "Fall",
"graphic_link": "https://www.aljazeera.com/wp-content/uploads/2024/06/AFP__20240119__1945806302__v1__HighRes__StockIndexesRiseSP500ClosesAtTwoYearHigh-1718073351.jpg?resize=770%2C513&quality=80",
"headline": "This research project focuses on stock price prediction through A.I. models as well as machine learning algorithms to maximize profit potential, improve investments, and eliminate risk. ",
"project_id": "ee8aff36bebcc433122a1e77eb970374a11c0aa1",
"expand": false,
"tags": [
"Stocks",
"Economics",
"Published"
],
"published": true,
"publications": "The Stanford Journal of Science, Technology, and Society",
"publication_link": "https://ojs.stanford.edu/ojs/index.php/intersect/article/view/3031",
"related_proj": [
2,
8,
14
]
},
{
"student_name": "Seokhyun C.",
"mentor_name": "Victoria Lloyd",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/student_imgs/a05002f29eb54c606f45e6ae8d8d7d6b9885493b.png",
"domains": [
"Environment",
"Published Papers and Science Fairs"
],
"project_title": "Predicting Climate Change Using an Autoregressive Long Short-Term Memory Model",
"project_desc": "Climate change is an issue that has and will affect humanity throughout the time that we are on Earth. Due to this fact, scientists have created numerous methods of modelling the climate, with mathematical models being used most widely. However, due to some limitations that these models hold, researchers have turned to utilize machine learning models to predict the future climate. This is because AI has a reputation for being able to handle complex data and recognize patterns that humans previously could not have. This study aims to create a baseline machine learning model that utilizes an Autoregressive Recurrent Neural network with a Long Short term memory implementation for the purpose of predicting climate. This type of model architecture has been utilized for other applications that relate to time series data, yet it has yet to be attempted to be used in the context of climate predictions. The data that was utilized was retrieved from the ensemble mean version of the ERA5 dataset. The model created from this study was able to predict the general trends of Earth(i.e, the poles are cold, and the equator region is warm) for both when predicting the climate and when it was predicting weather. When predicting the climate, the model was able to have fair accuracy for a long period of time, with the ability to predict seasonal patterns. This feat is one that other researchers were not able to do with the complex reanalysis data that this study has utilized. This work demonstrated that this type of model can be utilized in a climate forecasting approach as a viable alternative to mathematical models and can be utilized to supplement current work that is mostly successful in short term predictions. ",
"research_paper": "assets/pdfs/00c71309483550d6ac87c750f83854524d73733b.pdf",
"project_yr": "2022",
"project_quarter": "Winter",
"graphic_link": "https://ca-times.brightspotcdn.com/dims4/default/06cdef8/2147483647/strip/true/crop/2696x1664+0+0/resize/1200x741!/quality/75/?url=https%3A%2F%2Fcalifornia-times-brightspot.s3.amazonaws.com%2Fa4%2Faf%2F13fe5626461b9e801ac6e5161c51%2Funited-states-arctic-62857.jpg",
"headline": "This study aims to create a baseline machine learning model that utilizes an Autoregressive Recurrent Neural network with a Long Short term memory implementation for the purpose of predicting climate.",
"project_id": "598032afba3b6220dcc69ec7f0c0fd101475e823",
"expand": true,
"tags": [
"Climate Change",
"Sustainability",
"Published"
],
"published": true,
"publications": "Frontiers in Environmental Science",
"publication_link": "https://www.frontiersin.org/articles/10.3389/fenvs.2024.1301343/full",
"related_proj": [
113,
95,
70
]
},
{
"student_name": "Arnav J.",
"mentor_name": "Erick Siavichay",
"mentor_title": "Computer Science Alum from Stanford, Software Engineering and Data Science at Tagg",
"mentor_image": "assets/images/mentor_imgs/788ef03445f644b0e53860927d1ce0d68eeac867.png",
"student_image": "assets/images/missing_image.png",
"domains": [
"Healthcare and Biology",
"Published Papers and Science Fairs"
],
"project_title": "Evaluating the Efficacy of the 3D U-Net Architecture For Glioblastoma Multiforme Tumor Segmentation",
"project_desc": "Glioblastoma is the deadliest form of brain cancer which begins as a congregation of cancerous cells within the brain but then progresses into invading and destroying healthy brain tissue [9]. Radiation therapy, the most popular treatment option, is where neuro-oncologists apply intense radiation energy beams directly on the tumor region to kill the cancerous cells. However, for radiation therapy to be effective, the segmented magnetic resonance image (MRI) in which the oncologists base where to apply the radiation must be segmented nearly 100% accurately, or else the energy beams will mistakenly damage healthy brain tissue. Additionally, radiation therapy will be futile if the MRI segmentation is not complete in time for the therapy appointment. Our research problem is evaluating the performance of notable segmentation models for accurate and immediate 3D segmentation of glioblastoma multiforme brain tumors from MRIs while focusing on one particular architecture, the 3D U-Net. Each of the models was given 850 MRIs from the BraTS 2021 dataset, which is an annual competition hosted by the University of Pennsylvania [8]. Our 3D U-Net model achieved a testing accuracy of 0.986) accuracy and took approximately 55 seconds to predict segmentations of the tumor region. The results reveal that the 3D U-Net model is capable of automating glioblastoma tumor segmentation in significantly fewer hours than a human oncologist would take, all while maintaining similar or higher accuracy, where even a minor difference can mean the distinction between life and death.",
"research_paper": "assets/pdfs/97798e5c4041b6014d8653357f66d617d3f55498.pdf",
"project_yr": "2024",
"project_quarter": "Summer",
"graphic_link": "https://www.genengnews.com/wp-content/uploads/2023/09/GettyImages_1190794650_BrainTumor_cropped.jpg",
"headline": "This research evaluates the performance of the 3D U-Net model for automated glioblastoma tumor segmentation from MRIs, achieving 98.6% accuracy and significantly faster processing times than human oncologists, crucial for effective radiation therapy.",
"project_id": "7197e6be056bf44a07ddae9b38e3c556c4d93ec9",
"expand": true,
"tags": [
"Diseases and Health Conditions",
"Neuroscience"
],
"published": true,
"publications": "Journal of Student Research",
"publication_link": "https://www.jsr.org/hs/index.php/path/article/view/6558",
"related_proj": [
164,
125,
32
]
},
{
"student_name": "Suhaan K.",
"mentor_name": "Tony Rodriguez",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/student_imgs/b985b04a876866abc4fd7da68d847f15e821bbe2.png",
"domains": [
"Physics",
"Published Papers and Science Fairs"
],
"project_title": "The Use of Artificial Intelligence in Gravitational Microlensing Detection for Dark Matter Discoveries ",
"project_desc": "Gravitational lensing has been used for over three decades as effectively the only tool to reveal the nature of \"dark\" sources in the Milky Way. \"Dark\" sources include stars, planets, black holes, or even dark matter. We hypothesized that by using 600,000 stars as data obtained from The Zwicky Transient Facility, a telescope at the California Institute of Technology, we could use Artificial Intelligence to increase the speed and accuracy of microlensing detection. The main objective of this project was to distinguish between microlensing and non-microlensing events, the former of which could be dark matter events. Through the rigorous testing of various Artificial Intelligence models, including Linear Regression, KNN, SVC, Decision Tree, and others, we conducted a comparative analysis to discern disparities in accuracy, precision, and other recall among these models Through hyperparameter tuning and the elimination of false positives, the KNN model was confirmed to be the most accurate model, generating a 97% accuracy for the detection of microlensing events within the Milky Way. This scientific inquiry aims to improve future searches for microlensing events, ultimately expediting the current scientific processes employed in microlensing detection and helping astronomers place search constraints on the nature of dark matter through AI.",
"research_paper": "assets/pdfs/b2fb21db6acbc409292115f0d6f07206b1761e79.pdf",
"project_yr": "2025",
"project_quarter": "Spring",
"graphic_link": "https://i.natgeofe.com/n/f9954a0f-f88e-4ebe-b2c0-39d531a95afc/48384_16x9.jpg",
"headline": "This research utilizes Artificial Intelligence to enhance the speed and accuracy of microlensing detection using data from 600,000 stars observed by The Zwicky Transient Facility. Through comparative analysis and hyperparameter tuning, the KNN model was identified as the most accurate, achieving 97% accuracy in detecting microlensing events, which may include dark matter, within the Milky Way.",
"project_id": "ce2b76a0ff00e0d1bd386c2afaebfb5f77484b4d",
"expand": true,
"tags": [
"Engineering",
"Astronomy",
"Physics"
],
"published": true,
"science_fairs": "3rd Place in the Texas Science and Engineering Fair",
"publication_link": "https://txsef.tamu.edu/awards/2024-category-awards/",
"related_proj": [
16,
12,
36
]
},
{
"student_name": "Azaan R.",
"mentor_name": "Joe Xiao",
"mentor_title": "PhD, Data Scientist at Optum, prior lead AI/Data Scientist at Target and General Mills",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/missing_image.png",
"domains": [
"Healthcare and Biology",
"Published Papers and Science Fairs"
],
"project_title": "VisionAssist: Enhancing Accessibility for Individuals with Visual Impairment Through AI",
"project_desc": "Millions of people around the world are partially or totally blind. Such people face significant challenges in their educational pursuits and therefore they often struggle in life. Can Artificial Intelligence and latest developments in technology help visually impaired people understand literary content easier? To address this question, I have developed an Artificial Intelligence based proof of concept system that accurately converts any image with textual/mathematical data into Braille and Audio mediums as close to real-time as possible. The system takes an image as input and extracts text from it through optical character recognition (OCR) using a finetuned and optimized version of opensource PaddlePaddle OCR model. The system then has two different output options, an audio-based format, and a braille-based tactile format. Word Error Rate (WER) and Character Error Rate (CER) were used as evaluation metrics to evaluate performance of the system on different types of data. My system achieves a weighted average WER of approximately 7% with an average processing time of around 1 sec per 500 characters. The evaluation data shows that the system achieved the goal of having an accurate detection of the data in near real-time speed. My prototype outperforms much bigger and commercial models like ChatGPTv4 in terms of latency as well. This project shows that technology and AI can be an additional tool to help visually impaired people better understand educational content which can eventually uplift them in life.",
"project_yr": "2025",
"project_quarter": "Spring",
"graphic_link": "https://lhblind.org/wp-content/uploads/2023/10/AI-AdobeStock_567681994-web.jpg",
"headline": "This project explores how AI can support individuals with visual impairments by developing a system that converts images containing text or math into audio and Braille in near real-time. Using a fine-tuned OCR model, the system achieves high accuracy and low latency, demonstrating that AI can be a powerful tool for improving accessibility to educational content.",
"project_id": "4064cb21920d9ef34392a083b12c6a426307a1e4",
"expand": true,
"tags": [
"Eyes",
"Diseases and Health Conditions",
"Engineering"
],
"published": true,
"science_fairs": "1st Place in the Alameda County Science Fair",
"publication_link": "https://docs.google.com/presentation/d/1A3VusY0XctXC1fBKNjxb-26fjAWup7ajJel2ZtFxVsY/edit?slide=id.g335de835b50_0_965#slide=id.g335de835b50_0_965",
"related_proj": [
97,
135,
67
]
},
{
"student_name": "Anika S.",
"mentor_name": "Ying Hang Seah",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/student_imgs/3e81a81724cf0073d0800f5d96bcfd5615110f17.png",
"domains": [
"Environment",
"Engineering"
],
"project_title": "Landfill Net: A Convolutional Neural Network (CNN) Architecture for the Detection of Landfills from Satellite Imagery in the Continental United States",
"project_desc": "As we continue through the 21st century, the effects of climate change have become more and more apparent, with methane emissions playing a pivotal role as one of the most potent greenhouse gasses. Landfill decomposition is a large contributor to this rise in methane, and certain landfills have even adopted methane capture systems that enable the conversion of this methane into valuable energy resources. To continue to mitigate these emissions, it is imperative to establish a comprehensive United States landfill database. We present a Convolutional Neural Network (CNN) model that uses image data in the United States to distinguish whether an image is a landfill or not. The model had a test accuracy of 97.1%, a precision of 96.3%, a recall of 98.1%, and an F-1 score of 97.2%. The results show that the model holds the potential to uncover illegal landfills within the United States and facilitate an informed expansion of methane collection systems in landfill sites.",
"research_paper": "assets/pdfs/9b7a9bb8993cb8645f1200f22b54f5e7d19beefc.pdf",
"project_yr": "2023",
"project_quarter": "Fall",
"github": "https://github.com/anikaSeshan/LandfillNetResearch.git",
"graphic_link": "https://img.freepik.com/free-vector/car-dump-cartoon-composition-with-crushed-auto-landfill-vector-illustration_1284-81099.jpg?semt=ais_hybrid&w=740",
"headline": "This project presents a Convolutional Neural Network (CNN) model that identifies landfills in the U.S. using image data, achieving a 97.1% test accuracy. By supporting the creation of a national landfill database, the model can help detect illegal sites and guide the expansion of methane capture systems to reduce greenhouse gas emissions.",
"project_id": "cb6efe28ac166c1553671ee2e20ce03f130ee44f",
"expand": true,
"tags": [
"Sustainability",
"Pollution",
"Engineering"
],
"published": true,
"science_fairs": "Tackling Social Issues with AI | Inspirit AI Research Symposium | Category Winner",
"related_proj": [
70,
159,
125
]
},
{
"student_name": "Arpan A.",
"mentor_name": "Mirna Kheir Gouda",
"mentor_title": "Biomedical Engineering PhD Candidate at MIT",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/student_imgs/649fdf32d8c6917b69fb059ba2d77c623865341c.png",
"domains": [
"Environment"
],
"project_title": "Most Important Soil Properties to Consider for High Crop Yield",
"project_desc": "With the current population of Earth massively growing, there is a real risk coming up in the near future of not having enough food to feed the planet. One of the best ways to solve this upcoming problem is to improve the quality of the soil because that will help increase Earth’s overall crop yield in a sustainable way. However, figuring out how to improve soil can be a big challenge since there are so many elements and properties of soil to analyze. To help solve this problem we looked into the elements or properties of soil that could have the biggest influence on soils ability to be productive for growing crops. We found online datasets and used unsupervised machine learning by applying both principal component analysis (PCA) and hierarchal clustering on our data. We concluded that calcium, potassium, pH, and soil organic carbon are the elements or properties of soil that have the biggest influence on soil's ability to be productive for growing crops. If these results are accurate, it can massively help many farmers and people in improving their soil quality. This can lead to us taking a big step in the direction of increasing Earth’s overall crop yield. ",
"research_paper": "assets/pdfs/6db01dae04a368907c9e65aed25f76c0199aa535.pdf",
"project_yr": "2022",
"project_quarter": "Summer",
"graphic_link": "https://www.allaboutarizonanews.com/wp-content/uploads/2019/01/CottonField.PhoenixAZ.140320-1-1080x608.jpg",
"headline": "With the current population of Earth massively growing, there is a real risk coming up in the near future of not having enough food to feed the planet. One of the best ways to solve this upcoming problem is to improve the quality of the soil because that will help increase Earth’s overall crop yield in a sustainable way.",
"project_id": "c8ddc52adbc15aa68fc701cde83fd2e65d2e5fcd",
"expand": false,
"tags": [
"Earth Systems",
"Irrigation",
"Water Use"
],
"related_proj": [
63,
123,
33
]
},
{
"student_name": "Leah N.",
"mentor_name": "Odysseas Drosis",
"mentor_title": "PhD Candidate in Computer Science, Masters in Computer Science Alum from Cornell",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/student_imgs/23725615a8586e726fc6fa97c0ccdc962d6825bd.png",
"domains": [
"Healthcare and Biology"
],
"project_title": "Developing an Accurate AI Algorithm for Histopathologic Cancer Detection",
"project_desc": "\tThis paper discusses the development of a machine learning algorithm that accurately detects metastatic breast cancer (the cancer has spread elsewhere from its origin part) in select images that come from pathology scans of lymph node sections. Being able to develop an accurate artificial intelligence (AI) algorithm would help significantly in breast cancer diagnosis since manual examination of lymph node scans is both tedious and oftentimes highly subjective. The usage of AI in the diagnosis process provides a much more straightforward, reliable, and efficient method for medical professionals and would enable faster diagnosis and, therefore, more immediate treatment. The overall approach used was to train a convolution neural network (CNN) based on a set of pathology scan data and using the trained model to binarily classify if a new scan were benign or malignant, outputting a 0 or a 1, respectively. The final model’s prediction accuracy is very high, with 100% for the train set and over 70% for the test set. Being able to have such high accuracy using an AI model is monumental in regards to medical pathology and cancer detection. Having AI as a new tool capable of quick detection will significantly help medical professionals and patients suffering from cancer.",
"research_paper": "assets/pdfs/8a5e8935253f9eab111645e70bd509856ac1b1f8.pdf",
"project_yr": "2022",
"project_quarter": "Summer",
"graphic_link": "https://miro.medium.com/max/800/1*0YHaGNOMIhZaHkZ7RR-FFQ.jpeg",
"headline": "In this specific research project, we will be focusing on the lymph node scans of women with breast cancer, which is the most common cancer for women residing in the US, other than skin cancer. Research statistics show that about 1 in 8 women in the United States will develop invasive breast cancer throughout her life.",
"project_id": "497d2dfb6fdc004c66430347d7e0e2026a04d6c5",
"expand": false,
"tags": [
"Cancer",
"Diseases and Health Conditions"
],
"related_proj": [
163,
156,
7
]
},
{
"student_name": "Jason M.",
"mentor_name": "Anna Orosz",
"mentor_title": "Data Science MS from UPenn, Machine Learning Engineer at LinkedIn",
"mentor_image": "assets/images/missing_image.png",
"student_image": "assets/images/student_imgs/938de8743e5016120fb1d9f657fbd8f531a0858d.png",
"domains": [
"Sports and Motion Sciences",
"Engineering"
],
"project_title": "Allez Go: AI Fencing Referee",
"project_desc": "The gradual increase in online fencing videos over the past decade has allowed for novel technical projects in fencing that rely heavily on data, such as artificial intelligence. This study resulted in a state-of-the-art lightweight Temporal Convolutional Network to referee fencing bouts and classify actions as either a touch for the fencer on the left or the fencer on the right. To address this problem, we developed a pose estimation and audio analysis approach to autonomously referee fencing bouts. Using a custom dataset of international level fencing from the last 7 years, including ~4000 unique clips, our model achieved an accuracy of 89.1%, a 20% increase over previous state-of-the-art models. This model leverages advancements in human pose estimation to extract the position of both fencers and avoids high computational loads typically associated with CNNs. Additionally, it uses a novel technique to solve the issue of blade contact, a key component of refereeing fencing that was generally unaddressed in previous works. Our novel solution uses audio to ‘listen’ for the sound of blade contact rather than attempting to identify it visually. ",
"research_paper": "assets/pdfs/06bec52004cebec38e16889039e8afd90442d3c9.pdf",
"project_yr": "2022",
"project_quarter": "Spring",
"project_webpage": "https://bananapeel202-fencingwebapp-upload-xnj04o.streamlitapp.com/",
"graphic_link": "https://images.theconversation.com/files/133347/original/image-20160808-18050-xlihp1.jpg?ixlib=rb-1.1.0&q=45&auto=format&w=1200&h=900.0&fit=crop",
"headline": "Technology in fencing is generally an underdeveloped field and automated referees present potentially significant benefits to the sport. Automated referees will offer a more consistent call compared to a group of human referees with slightly different interpretations of the fencing rules.",
"project_id": "77ececfe30468cb64dc87fadf0e983552743a873",
"expand": false,
"tags": [
"Sports",
"Fencing",
"App Development",
"Audio Analysis"
],
"related_proj": [
49,
141,
167
]