GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:04:06 Sequence gi568815595f:23854535_24077416 : 222882 bp : 41.51% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 7037 7076 40 -3.05 1.01 Init + 7085 7359 275 1 2 43 72 193 0.391 9.69 1.02 Intr + 18596 18671 76 2 1 64 115 49 0.143 3.80 1.03 Intr + 24111 24277 167 2 2 26 101 103 0.394 3.24 1.04 Intr + 25078 25164 87 1 0 93 94 48 0.379 4.07 1.05 Intr + 33033 33165 133 0 1 53 116 121 0.747 11.13 1.06 Intr + 39632 39861 230 1 2 8 72 154 0.286 1.54 1.07 Term + 40304 40502 199 2 1 53 41 128 0.681 0.49 1.08 PlyA + 41787 41792 6 1.05 2.04 PlyA - 43282 43277 6 1.05 2.03 Term - 46515 46235 281 1 2 36 37 265 0.767 10.82 2.02 Intr - 48439 48352 88 1 1 47 36 122 0.512 1.52 2.01 Init - 53145 53083 63 0 0 69 81 72 0.610 5.80 2.00 Prom - 59793 59754 40 -7.15 3.00 Prom + 61768 61807 40 -7.45 3.01 Init + 63326 63497 172 1 1 99 102 209 0.995 23.05 3.02 Intr + 63906 64042 137 1 2 91 84 171 0.999 16.37 3.03 Term + 64662 64967 306 2 0 91 34 206 0.998 9.63 3.04 PlyA + 64997 65002 6 1.05 4.03 PlyA - 65343 65338 6 1.05 4.02 Term - 71998 71850 149 2 2 64 43 206 0.978 10.78 4.01 Init - 73020 72954 67 0 1 65 44 26 0.304 -2.81 4.00 Prom - 76023 75984 40 -7.55 5.00 Prom + 78421 78460 40 -6.65 5.01 Init + 82427 82552 126 1 0 63 97 132 0.847 11.81 5.02 Intr + 91671 91948 278 2 2 4 -9 269 0.007 4.19 5.03 Intr + 100064 100269 206 2 2 88 108 95 0.410 9.42 5.04 Intr + 101503 101591 89 2 2 96 98 49 0.973 5.47 5.05 Intr + 105137 105281 145 1 1 82 80 77 0.990 5.23 5.06 Intr + 107443 108071 629 2 2 50 40 468 0.758 29.20 5.07 Intr + 110443 110628 186 0 0 48 92 75 0.650 2.86 5.08 Intr + 113279 113489 211 1 1 113 115 210 0.998 23.76 5.09 Term + 122689 122885 197 2 2 60 48 114 0.804 1.19 5.10 PlyA + 122992 122997 6 1.05 6.04 PlyA - 123896 123891 6 1.05 6.03 Term - 127209 126818 392 1 2 30 37 282 0.872 11.46 6.02 Intr - 127873 127554 320 0 2 85 97 177 0.628 12.98 6.01 Init - 134533 134460 74 1 2 115 99 27 0.910 7.17 6.00 Prom - 140797 140758 40 -6.85 7.07 PlyA - 141219 141214 6 1.05 7.06 Term - 147631 147479 153 1 0 90 38 104 0.549 2.54 7.05 Intr - 162401 162340 62 0 2 89 80 86 0.254 5.43 7.04 Intr - 162824 162680 145 2 1 -49 103 115 0.232 -1.77 7.03 Intr - 164369 164209 161 0 2 74 93 155 0.670 13.39 7.02 Intr - 166065 165926 140 2 2 60 58 113 0.463 4.69 7.01 Init - 172293 172145 149 0 2 83 59 88 0.142 5.01 7.00 Prom - 186433 186394 40 -4.35 8.03 PlyA - 186451 186446 6 1.05 8.02 Term - 188860 188648 213 1 0 37 44 143 0.565 1.05 8.01 Init - 191760 191623 138 0 0 75 36 126 0.677 6.29 8.00 Prom - 198744 198705 40 -3.65 9.04 PlyA - 198941 198936 6 1.05 9.03 Term - 200485 200223 263 2 2 75 50 109 0.155 0.50 9.02 Intr - 200708 200541 168 0 0 11 86 124 0.087 3.50 9.01 Init - 210465 210339 127 0 1 60 87 98 0.951 7.27 9.00 Prom - 214555 214516 40 -4.85 10.03 PlyA - 215793 215788 6 1.05 10.02 Term - 216259 216073 187 0 1 40 48 154 0.665 2.58 10.01 Init - 217796 217708 89 2 2 61 14 144 0.584 4.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 62857 62636 222 1 0 71 45 314 0.993 21.33 S.002 Init - 63004 62981 24 1 0 67 79 33 0.811 -0.11 S.003 Term - 97284 97063 222 0 0 60 37 157 0.940 3.73 S.004 Init - 97659 97432 228 0 0 88 63 111 0.918 7.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:23854535_24077416|GENSCAN_predicted_peptide_1|388_aa MPRETHWRRNTQVDGRREECTGVGAHWDASRPSTGGMTQSLAGAVGEELGHQAARLQGKT ISLLAAPSAESCFHSIKPRTHSPSPHVILFFWLWPSHTWCRFRKISQYKCEKMGVKMAHA NPIVNCSCEGSRLRAPYENLMPDDLLLSPITPTWDCLVAGKQARGLPTDSTLCVCPNIWG GALIPVRGHQGGLGRGGGQGTSAGPKGDNIYEWRSTILGPPGSVYEGGVFFLDITFTPEY PFKPPKTLRVTEYSSQSSGRRRDPKPDFQIVPVKITQDDVTVDNGQQELSKASLRSSSEL IRSWNHYCLPSCLYIELELPVLQPAPQLLLITELTHGQPALAVTLVIILSDFTGHVDAAS SNLVSYSLTSPMVSSSLPQPFTTPVVIS >gi568815595f:23854535_24077416|GENSCAN_predicted_CDS_1|1167_bp atgccgagagaaacacattggcgaaggaatacacaggtggatggacgtcgagaggaatgc actggtgtaggagcacactgggatgccagcaggccatcgactggtggaatgacacaaagt ttggctggggcagttggagaagagttgggccaccaagcggccagactccaggggaaaacc atttcccttctggctgccccatctgctgagagctgcttccactcaataaaacctcgcact cattctccaagcccacatgtgatcctattcttctggctgtggccctctcacacctggtgt agattcaggaaaatctcacagtacaaatgtgagaaaatgggagtaaaaatggcacacgca aaccctattgtgaactgctcatgtgagggatctaggttgcgtgctccttatgagaatcta atgcctgatgatctgctactgtcacccatcacccccacatgggactgtctagttgcagga aaacaagctcgggggctccccactgattctacattatgcgtctgtcctaacatctgggga ggtgctctgatacctgtccgtgggcaccagggaggcctgggccgcggcggaggacagggc accagtgctggtcccaaaggcgataacatctatgaatggagatcaaccattctagggcct ccaggatccgtgtatgagggtggtgtattctttctcgatatcacttttacaccagaatat cccttcaagcctccaaagacactgcgcgtgacagaatacagcagtcaaagctcggggagg agaagagaccccaaaccagattttcagatagtgcccgtaaagattacacaagacgatgtg acagtagacaatgggcagcaagaactgtcaaaggcctctctgaggagcagcagtgaactc atcagatcttggaaccactactgtcttccttcctgtctgtacattgagcttgagctccct gtgctccagccggccccccaactgctcctcatcacagagctgactcatggtcaacctgct ctcgccgttactctggtcataattcttagtgatttcactggccacgtagatgctgcttcc agtaacctggtctcttattctttgacttctccaatggtctcatcatccttacctcagcca ttcaccactcccgtggtcatatcctag >gi568815595f:23854535_24077416|GENSCAN_predicted_peptide_2|143_aa MVKRGSNGNYTFSATDSICTKILQIPLDGDKELAASVLGAERESRRTSTFRMEDCETMED VYMASVETDRGVKEQLHLYDTRGLQEGVELPKHYFSFADGFVLVYSVNNLESFQRVELLK KEIDKFKDKKEASGYVKNAKCEL >gi568815595f:23854535_24077416|GENSCAN_predicted_CDS_2|432_bp atggtgaagagaggatctaatggcaactatacgtttagtgccactgactcgatttgcact aagattctccagattcccctggatggggataaagagctggctgccagtgttttgggagca gaaagggaaagccggaggacttcaacattcagaatggaagattgcgaaacaatggaagat gtatacatggcttcagtagaaacagaccgaggagtaaaagaacagttacatctttatgac accagaggtctacaggaaggcgtggagctgccaaagcattatttttcatttgctgatggc ttcgttcttgtgtacagtgtgaataaccttgaatcctttcaaagagtggagcttctgaag aaagaaatcgataagttcaaagacaaaaaagaggcaagtggatatgttaaaaatgccaaa tgtgaattataa >gi568815595f:23854535_24077416|GENSCAN_predicted_peptide_3|204_aa MGAYKYIQELWRKKQSDVMRFLLRVRCWQYRQLSALHRAPRPTRPDKARRLGYKAKQGYV IYRIRVRRGGRKRPVPKGATYGKPVHHGVNQLKFARSLQSVAEERAGRHCGALRVLNSYW VGEDSTYKFFEVILIDPFHKAIRRNPDTQWITKPVHKHREMRGLTSAGRKSRGLGKGHKF HHTIGGSRRAAWRRRNTLQLHRYR >gi568815595f:23854535_24077416|GENSCAN_predicted_CDS_3|615_bp atgggtgcatacaagtacatccaggagctatggagaaagaagcagtctgatgtcatgcgc tttcttctgagggtccgctgctggcagtaccgccagctctctgctctccacagggctccc cgccccacccggcctgataaagcgcgccgactgggctacaaggccaagcaaggttacgtt atatataggattcgtgttcgccgtggtggccgaaaacgcccagttcctaagggtgcaact tacggcaagcctgtccatcatggtgttaaccagctaaagtttgctcgaagccttcagtcc gttgcagaggagcgagctggacgccactgtggggctctgagagtcctgaattcttactgg gttggtgaagattccacatacaaattttttgaggttatcctcattgatccattccataaa gctatcagaagaaatcctgacacccagtggatcaccaaaccagtccacaagcacagggag atgcgtgggctgacatctgcaggccgaaagagccgtggccttggaaagggccacaagttc caccacactattggtggctctcgccgggcagcttggagaaggcgcaatactctccagctc caccgttaccgctaa >gi568815595f:23854535_24077416|GENSCAN_predicted_peptide_4|71_aa MPIKSNAFISFGGVSCRLPGFQKNDTKKKKRKKRKRKEKERKRKKRKKREEEDEEQEEEE EGGGGRRKKKK >gi568815595f:23854535_24077416|GENSCAN_predicted_CDS_4|216_bp atgccaattaaatcaaatgctttcatcagttttggaggagtgagttgccgtttgccagga tttcagaagaatgacacaaaaaaaaagaagaggaagaagaggaagaggaaagagaaggag agaaagaggaagaagagaaaaaagagagaagaggaggacgaggagcaagaggaggaggag gaaggaggaggaggaagaagaaagaagaagaaataa >gi568815595f:23854535_24077416|GENSCAN_predicted_peptide_5|688_aa MRCTWRIHQWRRAPDKLKLSEELIVEKGPTWVSKSPLSIGDKRALTPQCTGRRTLFSRGD PKARTPRKPNEPAPGEAGSCIPCRFRGGSAGRGRAAGRRRRLGEGVGLSRRDTLEGALRC FDPERLPADWVAPPLEGSENSFQSSSSSVPSSPNSSNSDTNGNPKNGDLANIEGILKNDR IDCSMKTSKSSAPGMTKSHSGVTKFSGMVLLCKVCGDVASGFHYGVHACEGCKGFFRRSI QQNIQYKKCLKNENCSIMRMNRNRCQQCRFKKCLSVGMSRDAVRFGRIPKREKQRMLIEM QSAMKTMMNSQFSGHLQNDTLVEHHEQTALPAQEQLRPKPQLEQENIKSSSPPSSDFAKE EVIGMVTRAHKDTFMYNQEQQENSAESMQPQRGERIPKNMEQYNLNHDHCGNGLSSHFPC SESQQHLNGQFKGRNIMHYPNGHAICIANGHCMNFSNAYTQRVCDRVPIDGFSQNENKNS YLCNTGGRMHLVCPMSKSPYVDPHKSGHEIWEEFSMSFTPAVKEVVEFAKRIPGFRDLSQ HDQVNLLKAGTFEVLMVRFASLFDAKERTVTFLSGKKYSVDDLHSMGAGDLLNSMFEFSE KLNALQLSDEEMSLFTAVVLVSADRSGIENVNSVEALQETLIRALRTLIMKNHPNEASIF TKLLLKLPDLRSLNNMHSEELLAFKVHP >gi568815595f:23854535_24077416|GENSCAN_predicted_CDS_5|2067_bp atgcgctgcacttggagaattcaccagtggaggagagctcctgataaactgaagctgagt gaagagttgattgttgaaaagggacccacctgggtctctaagtctcctctaagtattggc gacaagcgggcgctgacaccgcagtgcaccggacgccgcacgctcttttcgcgaggtgac cccaaggcgcggaccccgcgcaaaccaaacgaaccggcgcctggggaggctggtagctgc ataccttgcagattccgaggaggaagtgcaggacgagggcgtgctgcaggccggaggagg cgcctcggggaaggcgtggggctttcccgaagggatacgctcgaaggagctctgaggtgc ttcgatcccgagcgactccccgcagactgggtagcaccgccccttgagggttctgagaat agtttccagtcctcctcctcttctgttccatcttctccaaatagctctaattctgatacc aatggtaatcccaagaatggtgatctcgccaatattgaaggcatcttgaagaatgatcga atagattgttctatgaaaacaagcaaatcgagtgcacctgggatgacaaaaagtcatagt ggtgtgacaaaatttagtggcatggttctactgtgtaaagtctgtggggatgtggcgtca ggattccactatggagttcatgcttgcgaaggctgtaagggtttctttcggagaagtatt caacaaaacatccagtacaagaagtgcctgaagaatgaaaactgttctataatgagaatg aataggaacagatgtcagcaatgtcgcttcaaaaagtgtctgtctgttggaatgtcaaga gatgctgttcggtttggtcgtattcctaagcgtgaaaaacagaggatgctaattgaaatg caaagtgcaatgaagaccatgatgaacagccagttcagtggtcacttgcaaaatgacaca ttagtagaacatcatgaacagacagccttgccagcccaggaacagctgcgacccaagccc caactggagcaagaaaacatcaaaagctcttctcctccatcttctgattttgcaaaggaa gaagtgattggcatggtgaccagagctcacaaggatacctttatgtataatcaagagcag caagaaaactcagctgagagcatgcagccccagagaggagaacggattcccaagaacatg gagcaatataatttaaatcatgatcattgcggcaatgggcttagcagccattttccctgt agtgagagccagcagcatctcaatggacagttcaaagggaggaatataatgcattaccca aatggtcatgccatttgtattgcaaatggacattgtatgaacttctccaatgcttatact caaagagtatgtgatagagttccgatagatggattttctcagaatgagaacaagaatagt tacctgtgcaacactggaggaagaatgcatctggtttgtccaatgagtaagtctccatat gtggatcctcataaatcaggacatgaaatctgggaagaattttcgatgagcttcactcca gcagtgaaagaagtggtggaatttgcaaagcgtattcctgggttcagagatctctctcag catgaccaggtcaaccttttaaaggctgggacttttgaggttttaatggtacggttcgca tcattatttgatgcaaaggaacgtactgtcacctttttaagtggaaagaaatatagtgtg gatgatttacactcaatgggagcaggggatctgctaaactctatgtttgaatttagtgag aagctaaatgccctccaacttagtgatgaagagatgagtttgtttacagctgttgtcctg gtatctgcagatcgatctggaatagaaaacgtcaactctgtggaggctttgcaggaaact ctcattcgtgcactaaggaccttaataatgaaaaaccatccaaatgaggcctctattttt acaaaactgcttctaaagttgccagatcttcgatctttaaacaacatgcactctgaggag ctcttggcctttaaagttcacccttaa >gi568815595f:23854535_24077416|GENSCAN_predicted_peptide_6|261_aa MANLKLPMRSQLAHKIPETLAFMSWLRSACSPCYWHSLRPWSKVGAKSWGHEQQQETDGF LGRRRRVPSEAPPSGYRGPECWQLSRQPCRPEWKLVVPFPGPPMAARGPISMHFLLSEAH KIPRLSQSWGKAPLHLTLHSSVYLILPGCRTRTWDPLNGKAKSCNTNRIETCPLLTTLWV KERKAAASPSGTSHLGTPQAKAVIPSLEPCGSWHLQPSGHHCIPRCQLGKLLMVHLVQLQ PRREPAPGDVYPMAAADVSAQ >gi568815595f:23854535_24077416|GENSCAN_predicted_CDS_6|786_bp atggccaatctcaagctaccaatgcgaagtcaactggctcacaaaattcctgaaacattg gctttcatgagctggctcagaagtgcctgttccccctgctattggcactcactccgacct tggagcaaagttggggccaagtcctggggtcatgaacagcagcaagagacagacgggttc ctgggcagaaggaggcgggtccccagtgaggccccaccttcaggctacagagggcctgaa tgctggcaactgagccgccagccctgcagaccagagtggaaacttgtggtgccttttcca ggcccacccatggctgcccgtggaccaatcagcatgcacttcctcctctccgaggcccat aaaatccctaggctgagtcagagctggggaaaagctcctcttcatcttaccctccactca tctgtgtacctcattcttcctggttgcaggacaagaacttgggacccactgaatggcaaa gctaaaagttgtaacacaaataggattgaaacatgccccttgctcaccacgctgtgggtg aaggagagaaaagctgcagccagcccttcagggacgtcacacctgggaacgccccaagcc aaggctgtgattccctctttggagccctgtggttcctggcatcttcagccttccggccac cactgcattcccaggtgccagctgggaaagctgctcatggtgcacctggtccagctgcag cctcgcagagagcctgcacctggagatgtctatcccatggcagcagctgatgtgtctgca cagtag >gi568815595f:23854535_24077416|GENSCAN_predicted_peptide_7|269_aa MDEAGNHHPRQTNTGTENQTLHVLTHKWELNNENIWTQGGEHHTLEPVGREALTHPQKRS DGGGGPSGVASARTPAAAGEARPGLSALWSQWGPGSGVGSRLVSLSTACRANRVEQDQQA RAKLKQRRRQPQRFLAGEQHPEDFVIAGVKRAPLESTPMKEREERIRQKGIQAFDVGPAA SVNPMVSSGAKWPIRIMPGKECSNLRESVRGKLQILEIQTWAGSYPVAFSDSQAFGFRLN YAAGFPGSPAFRQHITGLLGLQNCVTQFP >gi568815595f:23854535_24077416|GENSCAN_predicted_CDS_7|810_bp atggatgaagctggaaaccatcatcctcggcaaactaacacaggaacagaaaaccaaaca ctgcatgttctcactcataagtgggagttgaacaatgagaacatatggacacagggagga gaacatcatacactggagcctgtcgggagagaagcactgacacatccacaaaaacgcagt gatggtggcggcggcccatctggagtggcctctgcgagaacaccagctgcagcaggggag gcgcggccgggactgagcgctctgtggagccagtgggggccaggatcaggcgtgggatcc aggctggtatctctgagcacagcctgcagggccaatcgagtggaacaagaccagcaggcg cgagcaaaactcaagcagaggcgccgccagccacagaggtttctggctggtgaacaacac ccagaggattttgtgatagcaggagtaaagagagcacccttggaatccacacctatgaaa gagagagaggaaaggatcaggcagaagggaatccaagcctttgatgtaggtccagcagcc tcagttaacccaatggtgagctctggagccaaatggcccatccgaattatgccagggaaa gaatgttctaacctccgggaaagtgtacgagggaagctccagattctggaaattcagact tgggctgggagttacccagttgccttctctgattctcaggcctttggattcagactgaac tatgctgctggctttcctggttctccagctttcaggcagcatatcacgggacttcttggc ctccaaaactgtgtgacccaattcccataa >gi568815595f:23854535_24077416|GENSCAN_predicted_peptide_8|116_aa MTEGENRPWLRGQFQVLDPGSGSISSDDEIDAMEGIGTGDIELGLQVLQATTPMELPVDR QRKLPAGHMSKPALHPAQESLQILNIRLQPQQEPLSKAPSRGLAHKIMSKIKWFIL >gi568815595f:23854535_24077416|GENSCAN_predicted_CDS_8|351_bp atgacagaaggggaaaataggccatggttaagaggacaattccaagttttagatcctggg agtggttccatttcaagtgatgatgagattgatgccatggagggcatagggacaggagac atagagctggggcttcaggtgctccaggcaacaacccccatggagctcccagtggacagg caacgtaaattgccagctggccacatgagtaagccagctttacatccagcccaagaaagc cttcagatacttaacatcagactgcaaccccaacaagaaccactcagcaaagccccttct agaggactggcccacaaaatcatgagcaaaataaaatggtttatactatga >gi568815595f:23854535_24077416|GENSCAN_predicted_peptide_9|185_aa MTVDYHRFNQVATLITAAVPDVLYQFASAGKAGNTLSLSCLRENKIPRNPTYKGCEGPLQ GELQTTAQRNKRGHKQMEEHSMLMNRKIQYRENGHTAQELEKTTLKFIWKQKRAHIAKTN LSQKNKAGGITLPDFKLYYKATVTKTAWYWYQTRDIDQWNRTEPSEITPHIYNRLIFDKP DKNKK >gi568815595f:23854535_24077416|GENSCAN_predicted_CDS_9|558_bp atgacggtggattatcataggtttaaccaggtagcaactttaattacagctgccgtacca gatgtgctgtaccagtttgcttcagctggcaaggctggcaatacactttcactgtcctgc ctcagagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaa ggagaactacaaaccactgctcaacgaaataaaagaggacacaaacaaatggaagaacat tccatgctcatgaataggaagattcaatatcgtgaaaatggccatactgcccaagaattg gaaaaaactactttaaagttcatatggaaacaaaaaagagcccacattgccaagacaaac ctaagccaaaagaacaaagctggaggcatcacgctacctgacttcaaactatactacaag gctacagtaaccaaaacagcatggtactggtaccaaaccagagatatagaccaatggaac agaacagagccctcagaaataacaccacacatctacaaccgtctgatctttgacaaacct gacaaaaacaagaaatag >gi568815595f:23854535_24077416|GENSCAN_predicted_peptide_10|91_aa MKDSKAGKEDGEMRAEKYAEELWGPEGEEHTFHNSKDIGSTYMPVNGGLDKENVVHIHHG ILHSNTRDEIMSLAAKQMQLEAIVLRKLMQE >gi568815595f:23854535_24077416|GENSCAN_predicted_CDS_10|276_bp atgaaggacagcaaggcaggaaaagaggacggtgagatgagggcagagaagtatgctgaa gagctctggggaccagagggtgaagagcacacttttcacaatagcaaagacataggatca acctacatgcccgtcaacggtggactggataaagaaaatgtggtacatatacaccatgga atactacacagcaatacaagagatgaaatcatgtcccttgcagcaaaacagatgcagctg gaggcgattgtcctaagaaaattgatgcaggaatga