GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:46:55 Sequence gi568815592r:34778611_34987957 : 209347 bp : 42.83% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 13515 13558 44 2 2 110 78 107 0.536 10.23 1.02 Intr + 36409 36540 132 1 0 94 32 146 0.906 8.44 1.03 Intr + 43043 43205 163 2 1 93 80 161 0.861 14.86 1.04 Intr + 44647 44738 92 0 2 79 88 85 0.996 5.47 1.05 Intr + 55607 55796 190 2 1 77 46 287 0.999 22.07 1.06 Intr + 56111 56251 141 1 0 109 89 86 0.995 10.43 1.07 Intr + 56684 56854 171 1 0 91 66 206 0.964 17.92 1.08 Intr + 57546 57738 193 2 1 76 80 206 0.942 16.74 1.09 Intr + 61267 61458 192 2 0 77 49 137 0.868 7.24 1.10 Intr + 73407 73605 199 1 1 68 110 56 0.385 3.29 1.11 Intr + 73975 74075 101 1 2 161 43 -14 0.485 0.23 1.12 Intr + 77007 77093 87 1 0 115 66 74 0.655 6.92 1.13 Intr + 77629 77799 171 2 0 104 94 90 0.991 10.19 1.14 Intr + 78180 78319 140 1 2 73 94 111 0.997 9.46 1.15 Intr + 78719 78860 142 1 1 17 115 91 0.634 3.71 1.16 Intr + 79114 79307 194 2 2 101 100 150 0.996 15.79 1.17 Intr + 79802 80961 1160 1 2 99 108 973 0.503 87.15 1.18 Intr + 85392 85588 197 0 2 71 84 249 0.919 20.94 1.19 Intr + 88615 88769 155 2 2 103 65 108 0.978 8.87 1.20 Intr + 88856 89006 151 1 1 40 110 192 0.980 15.41 1.21 Intr + 92244 92506 263 1 2 69 70 232 0.997 15.78 1.22 Intr + 92974 93082 109 0 1 110 101 89 0.957 11.34 1.23 Intr + 93209 93315 107 0 2 69 94 103 0.998 8.01 1.24 Term + 93703 93828 126 0 0 110 47 118 0.978 7.20 1.25 PlyA + 98732 98737 6 1.05 2.10 PlyA - 98963 98958 6 1.05 2.09 Term - 100110 99980 131 1 2 72 41 42 0.378 -4.74 2.08 Intr - 101453 101357 97 2 1 58 113 104 0.886 8.56 2.07 Intr - 101766 101679 88 2 1 52 86 100 0.999 5.25 2.06 Intr - 104470 104322 149 1 2 75 92 154 0.996 12.61 2.05 Intr - 109387 109177 211 0 1 43 75 362 0.002 28.39 2.04 Intr - 110981 110838 144 1 0 5 34 263 0.001 11.08 2.03 Intr - 111205 111142 64 2 1 74 37 137 0.002 4.06 2.02 Intr - 112860 112832 29 2 2 64 110 20 0.002 -1.36 2.01 Init - 121280 121060 221 2 2 74 97 96 0.326 7.25 2.00 Prom - 125235 125196 40 -7.25 3.00 Prom + 127984 128023 40 -5.75 3.01 Init + 128756 128929 174 1 0 45 103 86 0.778 5.19 3.02 Intr + 136628 136645 18 1 0 121 74 26 0.196 0.39 3.03 Intr + 138562 138745 184 0 1 12 87 168 0.172 7.54 3.04 Intr + 146699 146786 88 0 1 92 42 52 0.034 -0.89 3.05 Intr + 148110 148242 133 0 1 103 101 72 0.104 9.73 3.06 Intr + 161469 161591 123 2 0 80 59 47 0.010 0.86 3.07 Intr + 173508 173521 14 2 2 81 106 40 0.000 -2.24 3.08 Intr + 188629 188709 81 1 0 91 97 77 0.413 6.73 3.09 Intr + 191400 191556 157 0 1 109 52 155 0.993 13.09 3.10 Intr + 193971 194033 63 2 0 117 93 85 0.994 9.90 3.11 Intr + 203080 203376 297 0 0 98 69 375 0.806 32.75 3.12 Intr + 204142 204217 76 0 1 117 94 76 0.988 9.27 3.13 Intr + 204503 204604 102 0 0 97 64 125 0.988 10.23 3.14 Intr + 204714 204815 102 1 0 66 86 64 0.903 3.23 3.15 Intr + 206472 206725 254 1 2 140 49 253 0.365 22.83 3.16 Intr + 208474 208594 121 0 1 66 80 81 0.309 4.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:34778611_34987957|GENSCAN_predicted_peptide_1|1539_aa MAAAAAVSGAHAAARANCFISEPHALPLSNGDNNAYLMWDIAKINSDVMKTLRSVPGAKF TKNLSPDKINLSTLKGEGQLTNLELDEEVLQNVLELPTWLAITRVYCNRASIRCLDKVEV EMKTCEDPRPPNGQSPIALASGQSEYGFAEKVVEGMFIIVNSITIKIHSKAFHASFELWQ LQGYSVNPNWQQSDLRLTRITDPCRGEVLTFKEITWQTLRIEADATDNGDQDPVTTPLRL ITNQGRIQIALKRRTKDCNVISSKLMFLLDDLLWVLTDSQLKAMMKYAESLSEAMEKSAH QRKSLAPEPVQITPPAPSAQQSWAQAFGGSQGNSNSSSSRLSQYFEKFDVKESSYHLLIS RLDLHICDDSQSREPGVNSFTLSGRQRLYKSCAMPCCAVPSKLSPKSIASAGQLHLLQTI VVQGQGMTFLCYGYMAVITGSGPLSGPGQVQRCRLGAKAWNRGPQEPALYPTPLWPSWYP AARQSPLFSSVFFPQAGGVSPCGHHRCKEAVSPRAANCAAWGWGRGDASIPLAIPVGISG VSANRLMGGAMQLTFRKMAFDYYPFHWAGDSCKHWVRHCEAMETRGQWAQKLVMEFQSKM EKWHEETGLKPPWHLGVDSLFRRKADSLSSPRKNPLERSPSQGRQPAFQPPAWNRLRSSC MVVRVDDLDIHQVSTAGQPSKKPSTLLSCSRKLHNLPTQVSAIHIEFTEYYFPDNQELPV PCPNLYIQLNGLTFTMDPVSLLWGNLFCLDLYRSLEQFKAIYKLEDSSQKDEHLDIRLDA FWLKRPKASWDLWSVHFTQISLDFEGTENFKGHTLNFVAPFPLSIWACLPLRWQQAQARK LLLASEGRLKPSASFGSPVQSEALAPDSMSHPRSKTEHDLKSLSGLTEVMEILKEGSSGM DNKGPLTELEDVADVHMLVHSPAHVRVRLDHYQYLALLRLKEVLQRLQEQLTKDTESMTG SPLQNQTACIGVLFPSAEVALLMHPAPGAVDADSAGSDSTSLVDSELSPSEDRELKSDAS SDQGPASPEKVLEESSIENQDVSQERPHSNGELQDSGPLAQQLAGKGHEAVESLQAKKLS RTQASSSPAALKPPAGRETAVNGQGELIPLKNIEGELSSAIHMTKDATKEALHATMDLTK EAVSLTKDAFSLGRDRMTSTMHKMLSLPPAKEPMAKTDEGVAAPVSGGAARLRFFSMKRT VSQQSFDGVSLDSSGPEDRISVDSDGSDSFVMLLESESGPESVPPGSLSNVSDNAGVQGS PLVNNYGQGSPAANSSVSPSGEDLIFHPVSVLVLKVNEVSFGIEVRGEDLTVALQAEELT LQQLGTVGLWQFLHGQCPGTCFQESSTLKTGHIRPAVGLRFEVGPGAAVHSPLASQNGFL HLLLHGCDLELLTSVLSGLGPFLEDEEIPVVVPMQIELLNSSITLKDDIPPIYPTSPGPI PITLAMEHVVLKRSDDGVFHIGAAAQDKPSAEVLKSEKRQPPKEQVFLVPTGEVFEQQVK ELPILQKELIETKQALANANQDKEKLLQEIRKYNPFFEL >gi568815592r:34778611_34987957|GENSCAN_predicted_CDS_1|4620_bp atggcggcggcggcggctgtgtccggtgctcacgccgcggcgagggcaaactgcttcata tccgaacctcatgctcttcctctgagcaatggggataataatgcatacctcatgtgggat attgcaaagattaattcagatgttatgaagacccttcgttcagtgcctggtgcaaagttc actaagaatctttccccagacaaaatcaacctgagcaccctgaaaggggagggtcagctg accaacctggagctggatgaagaggttctacagaatgtactggagctgcccacctggtta gccatcactcgggtctactgcaacagggcctccatccggtgtctggataaggtagaggtg gagatgaagacatgtgaggatcctcggccccccaatggacagtctcccattgcccttgct tcaggacagagtgaatatggctttgccgaaaaggtggtggaagggatgttcatcattgtc aattctatcaccatcaagattcactccaaggccttccacgcttcttttgaattgtggcag ctccagggctatagtgtcaaccccaactggcagcagagtgaccttcgccttacccgcatc actgacccctgccgaggagaggttttaacatttaaggaaataacttggcaaacactccga attgaggcagatgctacagacaatggtgatcaggacccagtcaccactccattgaggctt attacgaaccaaggcaggatccaaatagccctcaaaagaagaaccaaagattgcaatgtg atatcctccaagctgatgttcctgttggatgacctgctctgggtgctgactgactcacag ctcaaggctatgatgaagtatgcagagtcactgagtgaagccatggagaagtcagcccat caaagaaagagcctggcccctgaacctgtgcagatcactccaccagcccccagtgcccag cagtcctgggcccaggcatttggtggcagccagggcaacagcaacagcagcagcagccgc ctcagccagtactttgagaaatttgatgtgaaagagtcctcctaccatctgctcatctcc cgcctggacctgcacatttgtgatgatagccagtcccgagagccaggggtcaatagcttc actctgtctgggcgccagcgcctgtacaagagctgtgccatgccgtgctgtgccgtgccg tccaagctgagccccaagagcatagcatcagcagggcagttacaccttttacagacaata gtggtacagggccaagggatgaccttcctatgttatggctacatggctgtgataacaggc agtgggcccctctctggcccagggcaggtccagagatgtcgtctaggagctaaagcctgg aatcggggaccccaggagcctgctttgtaccctaccccactatggccaagctggtaccca gctgcaagacaaagtccccttttctcttctgtcttctttcctcaagcaggaggagtctct ccttgtggtcaccacaggtgcaaggaagcagtctctcccagagctgcaaactgtgctgca tggggttgggggaggggtgatgcaagcattcctttggctatcccagttggcatctcaggt gtctctgccaacagactcatgggtggtgccatgcagcttaccttccgcaagatggcgttt gactattaccctttccattgggcaggtgatagctgcaaacattgggtacgccactgtgag gccatggagacccgaggccagtgggcccagaagctggtgatggaatttcagagcaaaatg gagaagtggcatgaagagacgggtctgaaaccaccctggcaccttggagtagactctctc tttcggagaaaagcagattctctttccagtcctcgaaagaaccctcttgagagaagcccc tctcagggcagacagcctgcctttcagcctccagcatggaaccgcttacgctctagctgc atggtggtacgggtggatgacctggacatccaccaggtttccaccgctggacagccaagt aaaaagccatctacactcctttcctgcagtcggaaacttcacaacctccctacccaggtc tctgccattcatattgagttcacagagtattacttcccagataatcaggagcttccagtt ccttgtcctaatctctacattcagttaaatggtctgacatttactatggatcctgtcagt ttgctctggggaaacctcttttgcctggatttataccgcagcttggagcagttcaaagct atctacaagctggaagattcaagtcagaaagatgaacacttggacatccgactagatgca ttctggttgaagagacctaaggcttcctgggatctctggtctgtccactttacccagatc tccttggactttgagggaacagaaaacttcaaaggccataccttgaattttgtagccccc ttccccctgtccatttgggcctgcctacccctccgctggcagcaagcccaggcacggaag cttcttttggcctcagaggggaggctgaaaccatcagccagttttggaagtcctgtccag tctgaggctcttgcccctgactctatgtcccatccgcggtcaaagactgaacatgacttg aaaagcttatcaggacttacagaagtcatggaaattctgaaagaaggcagtagtggtatg gacaacaaagggcctctgacagagctggaggatgtagcagatgttcatatgcttgtacat tccccggcccatgtccgcgtgaggcttgaccactaccagtacttggctctgcttcgcctg aaggaggtgctgcagaggcttcaggagcagctgactaaggatacagagtcaatgactggg tctcccctgcagaatcagacagcttgcattggagttctctttcccagtgctgaagtggct ctgcttatgcatcctgcacccggtgctgtcgatgctgactctgcaggctcagatagcact agcctcgtagattcagagctatctccttcagaggatcgggaactgaagtctgatgcctca tcagaccagggcccagcaagccctgagaaggtcttggaggaaagtagcattgaaaatcag gatgtatcccaggagaggccacatagcaatggagaactgcaggactcaggtccacttgcc cagcagctggcagggaagggccatgaggcagtagagtccctacaggccaagaaactgagc agaacccaagcctccagctcaccagctgcattgaagcccccagctggcagggagactgct gtgaatggacagggtgagctcatccccttgaagaacattgagggagaattgtcaagtgct attcacatgaccaaggatgccaccaaggaggctctacatgccaccatggacctcaccaag gaagctgtgtccctgactaaggatgccttcagtttgggcagagatcgaatgacctccacc atgcacaagatgttgtccctgcccccagccaaggagcccatggccaagacagatgagggg gtggcagccccagtgagtggaggtgctgcacgactccgatttttctccatgaagaggacg gtatctcaacagtcatttgatggtgtctcattggatagcagtggccctgaagaccggatt tcagtggacagtgatggcagtgatagctttgtgatgctcttggagtctgagtctggtcca gaatctgttccaccaggatctctttcaaatgtctcagataatgctggtgttcaagggagc cctcttgtgaataattatggccaggggtcaccagcagccaacagttcagtttcacccagt ggagaagacctcatctttcacccggtctcagttctggtcctgaaggtgaatgaggtgtct tttgggattgaggtacgtggtgaggacctgactgtggccctgcaagcagaggaactgacc ctccagcagctgggcaccgtgggactctggcagttcctgcatggacagtgcccaggtaca tgctttcaggaatcctcaactttgaagactggccacatcaggccagctgtgggccttcgc tttgaggtggggcctggagcagctgttcattcccccctggcctcacaaaatggcttccta catttattgcttcatggctgtgacctcgagctgctcacttcagtgctcagtggcctgggg cccttcttggaggatgaggagatcccggtggtagtccccatgcagattgagcttctgaac tccagcatcaccctaaaggatgatatcccccccatctatccaacatctccaggccccatc cccatcactctggccatggaacatgttgtgctgaagaggagtgatgatggtgtgttccac ataggcgctgctgctcaggacaaaccatcagctgaagtacttaaaagtgagaagagacag cccccaaaagaacaggtgtttttggtgcccacaggagaggtttttgaacagcaggtgaaa gaactgcctatcctacaaaaagaacttatagaaactaaacaagccttggccaatgccaac caggataaagaaaaacttcttcaggagattaggaaatataaccccttctttgagctctga >gi568815592r:34778611_34987957|GENSCAN_predicted_peptide_2|377_aa MAAGQPEKARIRRLAPLRTKANAQTPPGWIRGRVVSMMWIVMSKAKTGAPPQVQQRVKVL KTTSESLRTFQTLRHQNCRVANPAGSERETRRARSSVDSDLRALRLERGWLEEPRPPPPP PLPPPPPPEPPPPPPPKPEESRFPDSSFSTAGRHLRDLLSPPILSVMDDAHESPSDKGGE TGESDETAAVPGDPGATDTDGIPEETDGDADVDLKEAAAEEGELESQDVSDLTTVEREDS SLLNPAAKKLKIDTKEKKEKKQKVDEDEIQKMQILVSSFSEEQLNRYEMYRRSAFPKAAI KRLIQSITGTSVSQNVVIAMSGISKVFVGEVVEEALDVCEKWGEMPPLQPKHMREAVRRL KSKGQIPNSKHKKIIFF >gi568815592r:34778611_34987957|GENSCAN_predicted_CDS_2|1134_bp atggctgcagggcaacctgaaaaagcaagaatcaggcggctggccccattgaggacaaaa gccaatgcacagactccaccaggctggatcagaggaagagtggtgagcatgatgtggata gttatgagcaaagccaagactggggctcctccccaggtacagcaaagagttaaagttcta aaaaccacatcagaaagtctaagaacctttcagacactcagacatcagaactgcagagtg gcaaatccagccgggtcagaacgggagaccaggcgagcccggagcagcgtggactcggac ctgcgcgccctccgactggagagggggtggctggaagagccgaggccgccgccgccgccg ccgctgccgccgccgccgcccccagagccaccgccgccgccgcccccaaagcctgaggag agccgcttcccggacagcagcttctccaccgccgggaggcatctccgcgatctcctctcc cctccaatcctatccgtgatggacgatgcccacgagtcgccctccgacaaaggtggagag acaggggagtcggatgagacggccgctgtgcccggggacccgggggctaccgacaccgat ggaatcccagaggaaactgacggagacgcagatgtggacttgaaagaagctgcagcggag gaaggcgagctcgagagtcaggatgtctcagatttaacaacagttgaaagggaagactca tcattacttaatcctgcagccaaaaaactgaaaatagataccaaagaaaagaaagagaaa aagcagaaagtagatgaagatgagattcagaagatgcaaatcctggtttcttctttttct gaggagcagctgaaccgttatgaaatgtatcgccgctcagctttccctaaggcagccatc aaaaggctgatccagtccatcactggcacctctgtgtctcagaatgttgttattgctatg tctggtatttccaaggttttcgtcggggaggtggtagaagaagcactggatgtgtgtgag aagtggggagaaatgccaccactacaacccaaacatatgagggaagccgttagaaggtta aagtcaaaaggacagatccctaactcgaagcacaaaaaaatcatcttcttctag >gi568815592r:34778611_34987957|GENSCAN_predicted_peptide_3|663_aa MWGIGSRTPIYNKILVYSSPEVSCEKPKYVKSQTSTHTGFESHKRYIFYLRLVGKNLHVS AMGLGKSQERRKKALKSQPQQPADSFPSLPTVEGLTTSGEVAAGFWLSGEALSSCVVCGE AALTQVSAHRMAFLKSYVADDIKGEGKESLVGPWLRRKALFSFAAEHQSQSLIGGLSLLP VISYRILDSVQLWGLAVGNAKNSYKISSGFKNRVYFALSDEYIVSPKLRFHIVSLGKQLL LCDPGMWRGPNVNCVDSTGYTPLHHAALNGHKDVVEVLLRNDALTNVADSKGCYPLHLAA WKGDAQIVRLLIHQGPSHTRVNEQNALEIKELKKYGPFDPYINAKNNDNETALHCAAQYG HTEVVKVLLEELTDPTMRNNKFETPLDLAALYGRLEVVKMLLNAHPNLLSCNTKKHTPLH LAARNGHKAVVQVLLDAGMDSNYQTEMGSALHEAALFGKTDVVQILLAAGTDVNIKDNHG LTALDTVRELPSQKSQQIAALIEDHMTGKRSTKEVDKTPPPQPPLISSMDSISQKSQGDV EKAVTELIIDFDANAEEEGPYEALYNAISCHSLDSMASGRSSDQDSTNKEAEAAGVKPAG VRPVCDPGLTPPGGCVGWLAPEQKRVDALPFLLVYVVAVLSSWPLVINYGKGPVYLLVNT MDX >gi568815592r:34778611_34987957|GENSCAN_predicted_CDS_3|1989_bp atgtggggtattggttcgaggacccccatctataacaaaatccttgtatactcaagtcct gaggtcagctgtgagaaacccaagtatgtgaaaagtcagacctctacacacacaggtttt gaatcccacaaacgctatattttttatctgcgtttggttggaaaaaatctgcatgtcagt gccatgggcttggggaagagccaggagagaaggaagaaggctttgaagagccagccccag cagcctgctgacagttttccttctctccccactgtggaaggactgactaccagtggagag gtggcagcaggcttttggctaagtggagaggctctctcttcctgtgtagtttgtggagag gcagcgttgactcaagtctctgcccatagaatggctttcctcaagtcctatgtggcagat gacatcaagggtgaggggaaggagagtctggttgggccctggctgaggagaaaagctctt ttttcatttgctgctgagcaccagagccagtccctgattgggggcttgagcttgcttcct gtcatcagttacagaatattagatagtgtgcaactttggggcttggctgtggggaatgcc aagaatagttacaagataagcagtggatttaaaaacagagtgtattttgcactgtctgat gagtacattgtgtctccgaagttgagattccacatagtgagccttggaaaacaactgctg ctctgcgaccctggcatgtggagagggccaaatgtgaactgtgttgacagcactggctac acacccctgcaccatgctgctttgaatggccataaggatgtggtcgaggttcttctgagg aacgatgcgctgaccaacgtggctgactcaaaaggctgctaccctctgcatttggcagcc tggaaaggagatgcccagatagtgcggttgctcatccatcaagggccttcacacaccaga gtcaatgaacagaatgctcttgagatcaaagaactcaaaaagtacggcccctttgaccct tatatcaatgccaagaacaatgacaacgagacagccctgcattgtgcagcgcagtatggc cacacagaggtggtgaaggtgctcttagaggagctgacggaccccaccatgcgcaacaac aaattcgagacccctttggacctggcagcactgtacgggcgactggaggtggtgaaaatg ctccttaatgcacaccccaacctcctgagctgcaacactaagaagcacacccctctgcac ttggcagcaaggaatggccacaaagccgtggtccaggtcctcctcgatgctggcatggac agcaactaccagacggagatgggcagtgctttgcatgaggctgctttgtttggcaagacc gatgtggtgcaaatcctgctggctgcaggaactgacgtcaacataaaagataaccatgga ctgactgccctagacactgttcgggaactgccttctcaaaagagccagcaaatagcagca ttaattgaagatcacatgactggaaaaagaagtacaaaagaagtagataaaaccccccca ccccagccacctctcatctccagtatggactccatatcacagaagtctcagggtgacgtg gagaaagcagtgactgaactgattatagattttgatgcaaatgctgaagaagagggtccc tacgaagctctgtataatgccatctcctgccattcgttggacagcatggccagcgggcga tcatctgaccaagactccacgaacaaggaggctgaggcagcaggagtgaaacctgctgga gtgaggcctgtatgtgacccggggcttacacctcctgggggctgtgttggctggctggcc cctgagcagaaacgggtggatgctttgccatttttacttgtgtatgttgtggctgtgctg tcttcctggcccttggtaattaactatggcaaagggcctgtttacctcttggtaaacaca atggatgnn