GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:07:34 Sequence gi568815591f:55812676_56040003 : 227328 bp : 44.01% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 3647 3642 6 1.05 1.05 Term - 14607 14554 54 0 0 105 55 31 0.218 -0.94 1.04 Intr - 21911 21750 162 2 0 120 84 55 0.918 8.47 1.03 Intr - 30453 30267 187 2 1 55 95 103 0.962 7.39 1.02 Intr - 32043 31848 196 1 1 96 109 77 0.972 9.07 1.01 Init - 32563 32479 85 1 1 78 89 -6 0.392 -0.45 1.00 Prom - 34167 34128 40 -4.76 2.00 Prom + 38780 38819 40 -4.46 2.01 Init + 39901 40065 165 0 0 61 59 128 0.910 6.83 2.02 Term + 45406 45432 27 0 0 82 49 45 0.756 -1.93 2.03 PlyA + 45627 45632 6 1.05 3.00 Prom + 57955 57994 40 -1.86 3.01 Init + 59184 59291 108 2 0 77 39 96 0.236 3.82 3.02 Intr + 62483 62574 92 1 2 77 93 -12 0.152 -3.01 3.03 Intr + 65067 65111 45 0 0 70 83 84 0.887 3.62 3.04 Intr + 65276 65476 201 2 0 -12 89 204 0.931 8.90 3.05 Intr + 66414 66445 32 0 2 91 37 33 0.334 -3.73 3.06 Intr + 74609 74823 215 0 2 108 41 172 0.290 12.93 3.07 Intr + 74886 75005 120 2 0 34 105 113 0.728 8.29 3.08 Term + 75307 75324 18 0 0 128 49 -3 0.727 -1.88 3.09 PlyA + 76374 76379 6 1.05 4.00 Prom + 79049 79088 40 -0.36 4.01 Init + 108923 109159 237 1 0 92 43 163 0.824 10.21 4.02 Intr + 110487 110613 127 2 1 43 68 162 0.531 9.95 4.03 Intr + 110932 111024 93 2 0 38 111 52 0.667 2.44 4.04 Term + 126307 127331 1025 2 2 44 32 391 0.219 21.38 4.05 PlyA + 128805 128810 6 1.05 5.00 Prom + 129988 130027 40 -5.06 5.01 Init + 133147 133196 50 0 2 69 34 71 0.243 0.22 5.02 Intr + 139141 139255 115 1 1 -7 105 105 0.431 3.15 5.03 Term + 142234 142503 270 0 0 94 48 161 0.765 8.18 5.04 PlyA + 143806 143811 6 1.05 6.00 Prom + 147219 147258 40 -1.76 6.01 Init + 151935 152026 92 2 2 99 105 173 0.999 18.06 6.02 Intr + 165451 165590 140 1 2 107 65 72 0.436 6.91 6.03 Term + 188399 188529 131 0 2 0 54 117 0.037 -2.16 6.04 PlyA + 190020 190025 6 1.05 7.04 PlyA - 190034 190029 6 1.05 7.03 Term - 192388 192245 144 1 0 81 47 180 0.996 11.11 7.02 Intr - 194141 194050 92 0 2 89 63 21 0.734 -0.59 7.01 Init - 195303 195192 112 0 1 71 29 73 0.385 0.07 7.00 Prom - 195532 195493 40 -4.86 8.07 PlyA - 197760 197755 6 1.05 8.06 Term - 199194 199087 108 0 0 121 38 137 0.999 10.51 8.05 Intr - 202496 202348 149 0 2 109 34 103 0.992 6.95 8.04 Intr - 204704 204559 146 1 2 80 107 12 0.933 2.23 8.03 Intr - 207059 206925 135 1 0 100 116 66 0.994 10.28 8.02 Intr - 208556 208398 159 1 0 93 23 145 0.818 7.60 8.01 Intr - 221503 221286 218 1 2 59 110 123 0.499 8.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 57012 56910 103 0 1 60 68 115 0.974 5.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:55812676_56040003|GENSCAN_predicted_peptide_1|227_aa MASSSIHVPAKDMISFFFMAENMCHVLTWETGIGKSTLIDTLFNTNLKDNKSSHFYSNVG LQIQTYELQESNVQLKLTVVETVGYGDQIDKEASYQPIVDYIDAQFEAYLQEELKIKRSL FEYHDSRVHVCLYFISPTGHSLKSLDLLTMKNLDSKVNIIPLIAKADTISKNDLQTFKNK IMSELISNGIQIYQLPTDEETAAQANSSVSERCSFVYKGEAASPFST >gi568815591f:55812676_56040003|GENSCAN_predicted_CDS_1|684_bp atggcctccagctccatccatgttcctgcaaaggacatgatctccttcttttttatggct gagaacatgtgccatgttctcacttgggagactggaattggaaaatcgacactgatagac acattgtttaatactaacttgaaagataacaaatcctcacatttttactcaaatgttgga cttcaaattcagacatatgaacttcaggaaagcaatgttcagttgaaattgactgttgtg gagacagtagggtatggtgatcaaatagacaaagaagccagctaccaaccaatagttgac tacatagatgcccaatttgaggcctatcttcaagaagaactgaagattaaacgttccttg tttgagtaccatgattctcgcgtccacgtgtgtctttacttcatttcacctacaggacat tccctgaagtctcttgatctattaacaatgaagaaccttgacagtaaggtgaatattata ccactgattgccaaagcagacactatttctaaaaatgatttacagacgtttaagaataag ataatgagtgaattgattagcaatggcatccagatatatcagctcccaacagatgaagaa actgctgctcaagcgaactcctcagttagtgaacgctgctcctttgtgtacaaaggggaa gcagcttcacctttcagcacatga >gi568815591f:55812676_56040003|GENSCAN_predicted_peptide_2|63_aa MNLLKEKIGETFQDIDLGKYFLDNAPQAQETKAKMDKWDHMKLKSFYTAKKVINKNKKAF IYG >gi568815591f:55812676_56040003|GENSCAN_predicted_CDS_2|192_bp atgaacctactaaaagaaaagattggggaaactttccaggacattgacctgggcaaatat ttcttggataatgccccacaagcacaggaaaccaaagcaaaaatggacaaatgggatcat atgaaattaaaaagcttctacacagcaaagaaagtgatcaacaaaaataaaaaagccttc atctatgggtga >gi568815591f:55812676_56040003|GENSCAN_predicted_peptide_3|276_aa MDNYHTERIAGEIDIPEMSGDGKEQRRAEAISISHRAVCCMLYAVCSSSVAGGRGAGLLS VAAVLGSPAAEQFFPEYENPERDDPSTIEKLSKNKQKPITPETAEKLAHDLKIVKYVECS ALTQKGLKNVFDEATLAALEPPESKKSHRVDPVFDETHERPENSGKARGRGPRGPHSPDE KQTRESWWESGAPAMRIAMKEAGSERRQRMRIGDLRPASCAAIAGRLSSEQAAGPSAPSP SRVHRGGGGGGGGGGGGVRGRSLPKRPLSAEIFGLL >gi568815591f:55812676_56040003|GENSCAN_predicted_CDS_3|831_bp atggataattaccacacagaaagaattgctggagagattgacatacctgagatgtctggg gatggcaaggaacaaagaagggcagaagctatcagtatcagccacagagctgtatgctgt atgctctatgcagtatgcagcagttctgttgcagggggtcggggagcagggttgctgtca gtggcggcagtcctaggcagtcctgcagcagaacagttcttccctgaatacgagaaccca gaaagagatgatccctctactattgagaaactttccaagaacaaacagaagcctatcact ccagagactgctgaaaagctggcccatgacctgaagattgtcaagtatgtggagtgttct gcactcacacagaaaggcctaaagaatgtatttgacgaagcaacattggctgcgctggag cctccagaatcgaagaagagccacagagtagatccagtatttgatgaaactcatgaaaga ccggaaaattctgggaaggctcgaggccgcggtccccggggtccgcatagtcccgatgaa aaacagacccgggaaagctggtgggagtcaggagccccggcgatgaggattgcgatgaag gaagcaggctcggagcgccgccagcgcatgcgtattggggatctgaggccagcgtcttgc gccgccattgcggggaggctgtcctcagagcaggctgcggggccctcggcaccttctccc tcccgggtccaccgcggcggcggcggcggcggcggcggcggcggcggcggcgtcaggggg cggagcctgccgaagcgccctttgtctgcggagatttttggtcttttatag >gi568815591f:55812676_56040003|GENSCAN_predicted_peptide_4|493_aa MVEIARELELEVQPEDVVELLLKDEEKKWFLEMESVPGEDALNNVEMTIVDLEYSINLLD KAAGGFEKTDSNFERSSTVESLTFQDVAVDFTREEWDQLYPAQKNLYRDVMLENYRNLVA LGYQLCKPEVIAQLELEEEWVIERDSLLDTHPDGENRPEIKKSTTSQNISDENQTHEMIM ERLAGDSFWYSILGGLWDFDYHPEFNQENHKRYLGQVTLTHKKITQERSLECNKFAENCN LNSNLMQQRIPSIKIPLNSDTQGNSIKHNSDLIYYQGNYVRETPYEYSECGKIFNQHILL TDHIHTAEKPSECGKAFSHTSSLSQPQMLLTGEKPYKCDECGKRFSQRIHLIQHQRIHTG EKPFICNGCGKAFRQHSSFTQHLRIHTGEKPYKCNQCGKAFSRITSLTEHHRLHTGEKPY ECGFCGKAFSQRTHLNQHERTHTGEKPYKCNECGKAFSQSAHLNQHRKIHTREKLCEYKC EQTVRHSPSFSST >gi568815591f:55812676_56040003|GENSCAN_predicted_CDS_4|1482_bp atggtggaaatagcaagagaactagaattagaagtgcagcctgaagatgtggtggaattg ctgcttaaggatgaggaaaaaaagtggtttcttgagatggaatcagttcctggtgaagat gctttgaataatgtggaaatgacaatagtggatttagaatattccataaacttattagat aaagcagcaggaggatttgaaaagactgactccaattttgaaagaagttctactgtggaa tcactgacgtttcaggatgtggccgtggacttcaccagagaggagtgggaccagctgtac cctgcccaaaagaacctctatcgagacgtgatgctggagaactacaggaatctagttgca ctggggtatcagctttgtaagccagaggtaatcgcgcagttggagctagaggaagaatgg gtgatagaaagagacagcctgctggatactcatccagatggagaaaacagacccgaaatc aaaaagtcaaccacaagccagaatatttctgatgaaaatcaaacccatgagatgataatg gagagactcgcaggagacagcttctggtactccatcctaggaggactctgggattttgat taccatccagagtttaaccaagaaaaccacaagagatatttaggacaagtaactttgacc cacaaaaagatcacacaggagagaagccttgagtgtaataaatttgcagaaaactgtaat ctgaactcaaaccttatgcagcagagaattccttccattaaaatacccctgaattctgac acacagggaaacagcatcaaacataattcagacttgatttactatcagggaaattatgta agagagactccctatgaatatagtgagtgtggaaaaatcttcaatcaacatattcttctt actgatcatattcatactgcagagaaacccagtgagtgtgggaaggccttcagccacacc tcatctcttagccagcctcagatgttgcttacaggagagaagccctataagtgtgatgaa tgtggaaaaagattcagccagaggatacatctcattcaacatcagagaattcacacagga gaaaagccttttatatgcaatggatgtgggaaagccttccgtcagcattcatcctttact caacatctgaggattcatactggagaaaagccctataaatgtaatcaatgtggtaaagct tttagccgcatcacatcccttactgaacatcatagacttcataccggagagaaaccttac gaatgtggtttctgtggcaaagccttcagtcagaggacacatctgaatcaacatgaaaga actcatacaggagagaaaccctataaatgtaatgaatgcgggaaagcctttagccagagt gcacaccttaatcaacacaggaaaatccatactcgggagaaattatgtgaatataaatgt gagcaaactgttcgccacagtccttcatttagcagcacataa >gi568815591f:55812676_56040003|GENSCAN_predicted_peptide_5|144_aa MEFSEEDEQNLDIPEKRRCTAYERGSVGLSQCVRRDAIPLRPALPGAGGCIVLAEYFNKR KTYFAHDALQQCTVGDIVLLRALPVPRAKHVKHELAEIVFKVGKVIDPVTGKPCAGTTYL ESPLSSETTQLSKNLEELNISSAQ >gi568815591f:55812676_56040003|GENSCAN_predicted_CDS_5|435_bp atggaattctcagaagaggatgagcagaacttggacatcccagaaaaaaggagatgcacg gcctacgagaggggtagtgtgggactgagccaatgcgtacggcgtgacgccatcccgctg cgaccggcgctccccggggcaggtggctgcatagtcttggcggagtattttaataagcgg aaaacctactttgctcacgatgcccttcagcagtgcacagttggggatattgtgcttctc agagctttacctgttccacgagcaaagcatgtgaaacatgaactggctgagatcgttttc aaagttggaaaagtcatagatccagtgacaggaaagccctgtgctggaactacctacctg gagagtccgttgagttcggaaaccacccagctaagcaaaaatctggaagaactcaatatc tcttcagcacagtga >gi568815591f:55812676_56040003|GENSCAN_predicted_peptide_6|120_aa MAARVLRARGAAWAGGLLQRAAPCSLLPRLRTWTSSSNRSREDSWLKSLFVRKVDPRKDA HSNLLAKKETSNLYKLQLRLEELTKGVTVVRGEDQDEEEPAKETKKPPGNSADTQASEAS >gi568815591f:55812676_56040003|GENSCAN_predicted_CDS_6|363_bp atggcggcgcgagtgctgcgcgcccgcggagcggcctgggccggcggcctcctgcagcgg gcggccccctgcagcctcctgcccaggctccggacatggacatcttccagcaacagatct cgagaagacagctggctaaaatccttatttgtccggaaagttgatccaagaaaagatgcc cactccaatctcctagccaaaaaggaaacaagcaatctatacaaattacagttgagacta gaggagctcaccaagggagtgactgtagtcagaggagaggatcaagatgaggaggaacct gcaaaagagactaagaaaccaccagggaactcagcagacactcaagcaagtgaggcgtca tga >gi568815591f:55812676_56040003|GENSCAN_predicted_peptide_7|115_aa MADCPCDAIVHSDYEHRLRKQKELRIGIMKPRAEIFQAGKYPTRVHDSCEITFRAGTRGN DFHTILYLSLNVVDPEVCLTQLPPGGHLPVGQLHAACSAGPTDPHTDTDCADMLQ >gi568815591f:55812676_56040003|GENSCAN_predicted_CDS_7|348_bp atggctgattgtccgtgtgacgccattgttcattctgattatgaacaccgcttacgtaaa cagaaagagctaagaattgggatcatgaagccaagagcagaaatttttcaggctgggaaa taccctaccagagttcatgacagctgtgagattacattcagagctggaacccgaggcaat gatttccacaccattctttatctttccctcaatgtggtggatccagaagtctgccttaca caactgccccctggtggccaccttcccgtgggacagttacatgcagcctgttcagcaggc cccactgaccctcacaccgacacggactgtgcagatatgctgcagtga >gi568815591f:55812676_56040003|GENSCAN_predicted_peptide_8|304_aa SRQIWCLQFTEALAELIRSHPAYQQISRERSVEALGLHSSPPWELLARASEPALAPCSRK VAAAGAPEAQDTGRKILPAMVSHSELRKLFYSADAVCFDVDSTVIREEGIDELAKICGVE DAVSEMTRRAMGGAVPFKAALTERLALIQPSREQVQRLIAEQPPHLTPGIRELVSRLQER NVQVFLISGGFRSIVEHVASKLNIPATNVFANRLKFYFNGEYAGFDETQPTAESGGKGKV IKLLKEKFHFKKIIMIGDGATDMEACPPADAFIGFGGNVIRQQVKDNAKWYITDFVELLG ELEE >gi568815591f:55812676_56040003|GENSCAN_predicted_CDS_8|915_bp agcagacagatctggtgtctccagtttacagaagcacttgcagaactcatcagaagccac cccgcttatcagcagatctcaagagagcgttcggtggaggccctgggtctgcacagctca cctccctgggaactgctcgcccgagcgtcggagccggcgctggccccctgcagccggaag gttgcagccgcaggagccccggaggcccaggacacagggaggaaaattcttccagcgatg gtctcccactcagagctgaggaagcttttctactcagcagatgctgtgtgttttgatgtt gacagcacggtcatcagagaagaaggaatcgatgagctagccaaaatctgtggcgttgag gacgcggtgtcagaaatgacacggcgagccatgggcggggcagtgcctttcaaagctgct ctcacagagcgcttagccctcatccagccctccagggagcaggtgcagagactcatagca gagcaacccccacacctgacccccggcataagggagctggtaagtcgcctacaggagcga aatgttcaggttttcctaatatctggtggctttaggagtattgtagagcatgttgcttca aagctcaatatcccagcaaccaatgtatttgccaataggctgaaattctactttaacggt gaatatgcaggttttgatgagacgcagccaacagctgaatctggtggaaaaggaaaagtg attaaacttttaaaggaaaaatttcattttaagaaaataatcatgattggagatggtgcc acagatatggaagcctgtcctcctgctgatgctttcattggatttggaggaaatgtgatc aggcaacaagtcaaggataacgccaaatggtatatcactgattttgtagagctgctggga gaactggaagaataa