GENSCAN 1.0 Date run: 8-Nov-116 Time: 03:12:32 Sequence gi568815597r:225398347_225667416 : 269070 bp : 41.63% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 174 320 147 2 0 117 84 74 0.997 9.41 1.02 Term + 708 923 216 2 0 63 36 139 0.958 2.36 1.03 PlyA + 1284 1289 6 1.05 2.19 PlyA - 3181 3176 6 1.05 2.18 Term - 5117 4957 161 0 2 79 38 112 0.885 2.42 2.17 Intr - 6360 6280 81 1 0 -1 113 91 0.518 1.39 2.16 Intr - 8486 8318 169 2 1 98 92 73 0.973 7.30 2.15 Intr - 12070 11945 126 1 0 119 76 44 0.986 6.26 2.14 Intr - 13094 12991 104 0 2 73 71 85 0.981 4.27 2.13 Intr - 14299 14108 192 2 0 88 93 98 0.618 8.84 2.12 Intr - 16986 16932 55 0 1 72 15 45 0.211 -6.77 2.11 Intr - 19834 19638 197 2 2 79 102 122 0.928 10.91 2.10 Intr - 21106 20917 190 1 1 60 70 182 0.978 11.84 2.09 Intr - 21452 21369 84 2 0 61 75 53 0.568 0.40 2.08 Intr - 23931 23731 201 0 0 109 40 158 0.963 11.66 2.07 Intr - 26642 26487 156 2 0 46 113 121 0.875 9.69 2.06 Intr - 30393 30360 34 2 1 30 94 68 0.000 -1.09 2.05 Intr - 44020 43894 127 2 1 54 41 163 0.033 7.02 2.04 Intr - 50239 50116 124 1 1 81 5 79 0.011 -1.86 2.03 Intr - 52332 52228 105 0 0 92 41 83 0.294 3.49 2.02 Intr - 53968 53786 183 1 0 82 99 68 0.246 6.36 2.01 Init - 58852 58742 111 1 0 103 80 21 0.153 3.06 2.00 Prom - 59018 58979 40 -8.05 3.26 PlyA - 59136 59131 6 1.05 3.25 Term - 62121 61865 257 1 2 44 41 186 0.767 4.26 3.24 Intr - 63012 62688 325 0 1 105 39 157 0.579 6.92 3.23 Intr - 63326 63195 132 2 0 45 99 143 0.869 11.02 3.22 Intr - 65801 65716 86 0 2 98 35 56 0.818 -0.08 3.21 Intr - 67019 66877 143 1 2 86 94 157 0.609 15.18 3.20 Intr - 67688 67331 358 0 1 92 47 151 0.433 4.79 3.19 Intr - 69772 69419 354 2 0 77 59 276 0.175 17.94 3.18 Intr - 73218 73047 172 0 1 106 56 32 0.222 0.39 3.17 Intr - 76981 76765 217 0 1 97 61 160 0.093 11.78 3.16 Intr - 82058 81917 142 0 1 50 70 123 0.004 5.19 3.15 Intr - 100058 100001 58 2 1 71 115 66 0.583 5.14 3.14 Intr - 114368 114311 58 1 1 80 78 88 0.757 4.97 3.13 Intr - 114670 114525 146 1 2 44 94 101 0.929 4.46 3.12 Intr - 116464 116250 215 2 2 122 66 178 0.850 16.51 3.11 Intr - 119644 118850 795 2 0 76 86 365 0.939 25.60 3.10 Intr - 121219 120852 368 0 2 107 99 617 0.999 58.56 3.09 Intr - 132292 132208 85 2 1 115 94 59 0.806 7.16 3.08 Intr - 137285 137161 125 1 2 -9 91 116 0.014 1.51 3.07 Intr - 149638 149500 139 2 1 105 41 33 0.011 -1.10 3.06 Intr - 156737 156560 178 2 1 80 110 159 0.988 15.87 3.05 Intr - 169068 168903 166 2 1 61 110 88 0.669 7.34 3.04 Intr - 246797 246630 168 1 0 45 81 107 0.007 3.84 3.03 Intr - 250702 250677 26 1 2 127 33 -16 0.016 -7.29 3.02 Intr - 254553 254340 214 2 1 96 78 106 0.849 8.20 3.01 Intr - 255814 255090 725 1 2 63 26 362 0.930 16.80 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 30175 30477 303 0 0 52 43 223 0.919 9.78 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:225398347_225667416|GENSCAN_predicted_peptide_1|120_aa GSASSHTGVYIFGLFIEGARWNREQKILEDSLPLEMCCDFPDIYFLPTKISTKTPNASNQ TDSELYAFECPVYQTPERSRILATTGLPTNFLTSVYLSTKKPPSHWITMRVALLCEKNEK >gi568815597r:225398347_225667416|GENSCAN_predicted_CDS_1|363_bp ggctcagcttcctctcacactggagtttacatttttggtttattcatcgagggggcaaga tggaatcgtgaacagaaaatactggaagactcgctgcctctggagatgtgctgtgatttt cccgacatatactttttgccaacaaagatttctaccaaaacaccaaatgcttccaaccag acagattcagaactctatgcttttgaatgcccagtttaccagacacctgagaggtcaaga attttggcaactaccggtttaccaacaaactttttaacatcagtgtatttatcaacgaag aaacctcctagtcactggatcacaatgcgggttgcattgctttgtgagaagaatgaaaaa taa >gi568815597r:225398347_225667416|GENSCAN_predicted_peptide_2|799_aa MEGPRDHCVSEINQTQNYKCPLSHMWELKKLISCLVQPLAISTPLPFPPDGEAPGQRLGF FVAASPPPLLAPASANSCGARVPQERGLCTTAERSPAQEAPGLRQQDDELAELRTGCLET GTAAFIRRNSCSRSGHNLKTNWTEHPSPKISARSHDGGSWEGKRCEDFCDSKTLGSPHRA LASSRVLCAGVLESIMVEGVPPSAEEKALIPCVNRSSISGNNRAPPRSLVHVVLGKVGSP GFVVWKCDSAGQPMERHYCLAAGSTQLSIDISKRVADRPHPLTSFRQRKGGSTSSSPSRR RGSRSRSRSRSPGRPPKSARRSASASHQADIKEARREVEVKLTPLILKPFGNSISRYNGE PEHIERNDAPHKNTQEKFSLSQESSYIATQYSLRPRREEVKLKEIDSKEEKYVAKELAVR TFEVTPIRAKDLEFGGVPGVFLIMFGLPVFLFLLLLMCKQKDPSLLNFPPPLPALYELWE TRVFGVYLLWFLIQVLFYLLPIGKVVEGTPLIDGRRLKYRLNGFYAFILTSAVIGTSLFQ GVEFHYVYSHFLQFALAATVFCVVLSVYLYMRSLKAPRNDLSPASSGNAVYDFFIGRELN PRIGTFDLKYFCELRPGLIGWVVINLVMLLAEMKIQDRAVPSLAMILVNSFQLLYVVDAL WNEEALLTTMDIIHDGFGFMLAFGDLVWVPFIYSFQAFYLVSHPNEVSWPMASLIIVLKL CGYVIFRGANSQKNAFRKNPSDPKLARFNHILPYFYIIYFTMLLVHREARDEYHCKKKYG VAWEKYCQRVPYRIFPYIY >gi568815597r:225398347_225667416|GENSCAN_predicted_CDS_2|2400_bp atggagggacctagagatcattgtgttagtgaaataaaccagacacagaactacaaatgt cctctcagtcatatgtgggagctaaagaagttgatctcatgtctcgtgcagcccttagcc atatctacaccactgcctttcccaccagacggtgaggccccaggacagagactggggttt tttgttgctgcttcaccgcccccgctgctggcccctgcaagcgccaattcttgtggtgca cgagttccccaggagaggggcctctgcactactgccgaacggagcccagcacaggaagcc ccagggctgcgacagcaggacgatgagctggcagagcttcgcacaggctgcctggaaact ggcacagctgcattcatcagaagaaacagttgcagccggagtggacacaacctgaaaacc aactggactgagcatccttctcctaaaatctcagccagaagccacgatggagggtcctgg gaagggaagagatgtgaagatttctgtgattctaaaaccttgggaagccctcacagggct ctggcatccagcagggtgctctgtgcaggtgttttggagagcatcatggtggagggtgtt cctccttctgcagaggagaaggcactcattccctgtgtcaaccggagcagcatctcgggt aacaaccgagcccctcctcgcagcctagttcatgtggtactggggaaggtcggcagccct ggttttgttgtgtggaaatgtgattccgctggtcagccgatggagcggcattactgtctg gcagctggaagtacccagctatcaattgatatttcaaaaagggtggcagacaggcctcac cctttaacttcctttaggcaaaggaaaggtggctcaacttccagttccccttccagacgc cgagggagtcgatcaaggtcacgctcccgatcccctggtcgaccacctaaaagtgcccgc cgatctgcttctgcttcccaccaggccgacattaaggaagcaaggagggaagtggaagtt aaattgactccgctgattctgaagccatttggaaatagcatcagcagatataatggggag cctgagcatattgagagaaatgacgcacctcataaaaatacacaggaaaaattcagtttg tcacaagaaagcagttacatagcaacacagtatagccttcgtccaagaagagaagaagtc aaattaaaagaaatagattctaaggaagaaaaatacgttgcaaaagaactggcagtgaga acctttgaagtgacccccatccgggcaaaggacttggagtttggaggagtacctggtgtg tttctcatcatgtttggcctgcctgtgttcctcttcctgttgctgttgatgtgtaaacag aaagatcccagtcttctgaatttccctcctcctttgccagctttgtatgagttatgggaa accagagtatttggggtctacctcctgtggtttttgattcaagtcctgttctacctactg ccaattggaaaggttgtagaaggaacgcctcttattgatggaagaagactcaagtataga ttaaatggattctatgcttttatcctgacatctgcagtcatcggaacatctctcttccag ggcgtagagtttcattacgtgtacagtcattttcttcagtttgcacttgcggccactgtt ttttgtgtggtcttgagtgtgtatctctacatgcgctctttgaaagcgccccggaatgac ctgtcgcctgccagctctggaaatgctgtctatgatttcttcattggccgtgaattaaac cctcgaattggtacttttgatctcaaatacttttgtgaattgcgccccggattgattgga tgggtggttattaacttggtgatgcttttggctgaaatgaaaatacaggaccgcgctgtt ccatccttggccatgattttagttaatagtttccagcttctctatgtggtggatgctctc tggaatgaggaagcgttgttgacgaccatggacatcatccacgatggatttggattcatg ctggcttttggagacttggtgtgggttccctttatttacagcttccaagccttttattta gtcagtcatccaaatgaagtgtcttggccaatggcttctctaattattgttctgaaactt tgtggttatgtaatcttccgaggtgcaaattctcagaaaaatgcattccggaaaaatccc agtgatccaaagcttgcacgttttaaccacattctgccttatttctacataatttatttc accatgttgcttgtccaccgagaagctcgtgacgagtaccactgtaagaagaaatacggc gtggcttgggaaaagtactgtcagcgtgtgccctaccgtatatttccatacatctactaa >gi568815597r:225398347_225667416|GENSCAN_predicted_peptide_3|1883_aa GGDFKTIGNDEKFKIIISSSLSPPPARDSLFANDAGKRLHWLRFSSSTLAATHCSARVSR RLRFPESRLRSFGDEVIPIPSVHINSFHRRSTLGFLGKKRSEGNNDDHIFHLYAPPQLRR NRWAQPRRGRAVGLTGMPPDSLIPVLRPQQGGLSTRPARGHPAASRPGVSWPGPVPSHAS KLPTPVPTGLAPRRSRRLGLRTVAVGEASELPKPLGSGRRRGGGRRSRTTTGEEPRPTTG RSLSLALFSHTPGRRLGFVRGDGVAPWPPKEEPDTCLPSPSCSPPLEERPPPRLGAFCVS RLVGKPLRRRHHEAYFYVRLFGREIEDRTTGHFSVGKQARGNAEVETLVVVALKGLNKFW SCIFVLRRQPAISLVGLFEQSICQARAAVMVYDDANKKWVPAGGSTGFSRVHIYHHTGNN TFRVVGRKIQDHQVVINCAIPKGLKYNQATQTFHQWRDARQVYGLNFGSKEDANVFASAM MHALEVLNSQETGEVRHNHNTQEKPCLRTDNILYWVCEGDEPPSHWSSREHISDKLTFSL CEVLQMAPLSFEASNTKVSVAAQSKVTATQDSTNLRCIFCGPTLPRQNSQLPAQVQNGPS QEELEIQRRQLQEQQRQKELERERLERERMERERLERERLERERLERERLEQEQLERERQ ERERQERLERQERLERQERLERQERLDRERQERQERERLERLERERQERERQEQLEREQL EWERERRISSAAPSSDSSLYNAPLPEYSSCQPPSAPPPSYAKVISAPVSDATPDYAVVTA LPPTSTPPTPPLRHSATRFATSLGSAFHPVLPHYATVPRPLNKNSRPSSPVNTPSSQPPA TKPCAWSTSNFSPLPPSPPIMISSPPGKATGPRPVLPVCVSSPVPQMPPSPTAPNGLVDS VTYPVSPPPTSGPAAPPPPPPLPSLASLSHCGSQASPPPSTPIASTPSSKPSVLPSPSAA APASVETPLNSVLGDSSASEPGLQAASQPAETPSQQGPPPPPPLPSTGPPPPPPPPPLPN QVPPPPPPPPAPPLPASGFFLASMSEDNRPLTGLAAAIAGAKLRKVSRMEDTSFPSGGNA IGVNSASSKTDTGRGNGPLPLGGSGLMEEMSALLARRRRIAEKGSTIETEQKEDKGDILD EMRKELTKLKEELIDARSSGCGYEAGTPGPHLHPKARLGKAEFWKATLMCINASVLNLGS MVCRAGVGICRQPLGSSQGSGESVEQGKAARPKAAGVQVRWAPKASRVLPGPRAWTVALE RPLAVMLGSAPRPTRAQGSCPAQGGDRPLTGVDRRTGCRMTTCALSVQSGVTSEAPFPST PSSYLSLLCPEAAPSGLREPLELQRDADSAGPSLPAPSSPRASTWAGGGGGENQGAGREV KRHRRGAPCSVLHPRHREHRLLRSLAWRCPAATHPPSEGCSQHREREARQEVTSISPATE RAAPPASAAGWAQRCPHGVLLGGQTVPRDPFRALEMGELCGSSRAHATGCGQSGCLRRRQ AFPIGGPQAQGPLPAGPRATQVIPLWSGCGSWFLLAWAAPEAQRISSQAHPSFLLWEPGG GAQSPVCPSRSHFLSHPAPNVRGTGPTNLHPAASSTWALLSDGSVTKHPDRRAPASLALP PLQPVRSDLLVHLSEWLSGETLSAAGAGSGGCGIVWAEELGRLPEVSNGPSGPREQSTQP QEIRERRQGQTLVNISKPGWRGTVDEDEVGSRSSKSFQSHLILPASSRQEQQAPCAVLHQ TGKDSWTRREEAAGEMKLPPQPRQDSSQGCLSASCAGKTPSPPSPAGRGEPHLLLLILSC REDGPGSPQTTDTNRGPGWGGRYQRQLPRTLGDAPSAPPFATRGLKKDFLPWENPDRPPC HLFFTLIRSSPWDNLTVPVQLCG >gi568815597r:225398347_225667416|GENSCAN_predicted_CDS_3|5652_bp ggaggtgacttcaagaccattgggaatgacgaaaaatttaaaattatcattagttcctct ctctccccgccccctgcccgggattccctctttgctaatgacgctggaaagcggctccac tggctgcgtttctcttcttccactcttgcggcaacacactgttccgcgcgcgtctcccgg cgactgcgtttccccgagagtcggctgcgttcgtttggggatgaagtcatccctatacct agtgtgcacatcaactcgttccatcggcgcagcacgctgggattcctcgggaagaaaagg agcgaagggaacaatgatgaccatattttccatttatatgcccctcctcaactccgcagg aatcgctgggcgcagccgcggcgagggcgggcagttgggctgaccgggatgccccccgat tccctcatccccgtcctgagaccgcaacagggcgggctgagcactcgccccgcgcgaggg cacccggccgcgagccgacccggagtctcctggcccggccctgtgcccagtcacgcgtcc aagctccccacgccggtgcctacgggtctggcgcccaggaggtctcggcgcctcggtctc cgcaccgtggccgtgggggaggcctcggagctccccaagcctctggggagcgggaggcga cggggagggggccgacggtcaaggacaaccacgggcgaggaaccccggccgacaactggc cggagcctttctctcgccctcttctcccacactcccggccggcgcctcggctttgtgcga ggagatggtgtagccccctggccgccgaaggaggagccggacacttgtctcccgtctccg agctgctccccacccctggaggagagacccccccctcggctcggcgccttctgcgtctcc cggctggtggggaagcctctgcgccgccggcaccatgaggcctatttctatgttagacta tttggaagagagattgaagatagaaccactggacatttctctgtagggaaacaagcacgt gggaatgctgaggtggaaaccttggtagtggtggctttgaaaggcctgaacaaattttgg agctgtatcttcgtgctaaggagacagccagccatctctttggtaggcctgtttgaacag agtatctgtcaggcaagagctgctgtgatggtttatgatgatgccaataagaagtgggtg ccagctggtggctcaactggattcagcagagttcatatctatcaccatacaggcaacaac acattcagagtggtgggcaggaagattcaggaccatcaggtcgtgataaactgtgccatt cctaaagggttgaagtacaatcaagctacacagaccttccaccagtggcgagatgctaga caggtgtatggtctcaactttggcagcaaagaggatgccaatgtcttcgcaagtgccatg atgcatgccttagaagtgttaaattcacaggaaacaggggaggtaaggcacaaccataac acacaagagaaaccctgtctgaggacagataacatactgtattgggtgtgtgaaggggac gagccgccttctcactggagcagccgggaacatattagtgacaagctgactttcagttta tgtgaagttcttcagatggccccgttgtccttcgaagcttctaacaccaaggtgtctgtt gcagcccagagcaaggttactgctacccaggacagcactaatttgcgatgtattttctgt gggccaacattgcctagacaaaactcacaactacctgctcaagttcaaaatggcccatcc caagaagaattggaaattcaaagaagacaactacaagaacagcaacggcaaaaggagctg gagcgggaaaggctggagcgagaaagaatggaaagagaaaggttggagagagagaggtta gaaagggaaaggctggagagggagcgactggaacaagaacagctggagagagagagacaa gaacgggaacggcaggaacgcctggagcggcaggaacgcctggagcggcaggaacgcctg gagcggcaggaacgcctggatcgggagaggcaagaaagacaagaacgagagaggctggag agactggaacgggagaggcaagaaagggagcgacaagagcagttagaaagggaacagctg gaatgggagagagagcgcagaatatcaagtgctgctccatcttcagacagctccctgtat aacgctccacttcctgagtattccagttgccagcctccttcagcacctcctccatcatac gctaaagtcatctcagctccagtgtcagatgccactcctgattatgctgtagtgactgct ttgccacctacttccacaccccctacaccaccactgcgacactcagcgacacgttttgca acatctttaggttcagccttccaccctgttcttccccattacgctacagttcctcgtcct ctgaacaaaaactctcgaccttcttctcctgtgaacacaccctcttctcagcctccagct acgaagccctgtgcctggtctacttccaatttttcgcccctccctccatctcctccaata atgattagcagccccccaggcaaagctactggtccaaggcctgtcctccctgtttgtgtt tcttctcctgtgccccaaatgcctccatcaccgacagcacccaatgggctggttgactct gtaacatatccagtgtctccaccgcctacctcagggccagcagctcctccgccgccgcct ccactgccttccctcgcatcactctcacactgtggatctcaagcttctcctcctccaagc acccctattgcctcaactccctcatccaagcctagtgttctcccttctccctctgcagct gcccctgcctctgttgagactcctctaaactctgtgctgggagactcttctgcttctgag ccaggcttgcaggcagcctctcagccggccgagactccatcccaacaggggccccctcca cctcctccactcccatccaccgggcctccaccgccccctcctccccctcctctccctaat caagtaccccctcctcctccaccacctcctgccccacccctccctgcatctggattcttt ttggcatccatgtcagaagacaatcgccctttaactggacttgcagctgcaattgccgga gcaaaacttaggaaagtgtcacggatggaggatacctctttcccaagtggagggaatgct attggtgtgaactccgcctcatctaaaacagatacaggccgtggaaatggaccccttcct ttagggggtagtggtttaatggaagaaatgagtgccctgctggccaggaggagaagaatt gctgaaaagggatcaacaatagaaacagaacaaaaagaggacaaaggtgacattttagat gaaatgagaaaagaattaacaaagctaaaagaagagctcattgatgcaaggagcagtgga tgtggctatgaggcaggcaccccagggcctcatcttcaccccaaagcaaggctagggaaa gcggagttctggaaagccacactgatgtgcatcaacgcctctgtgctaaaccttggatcc atggtctgccgggctggtgtggggatttgcaggcaaccactcggatcctcccaagggagc ggggagagtgtggagcagggaaaggcggcacgtcccaaggccgcaggcgtgcaggtccgc tgggcgcctaaggccagccgggttctgccggggcccagggcctggacggtggcgctcgag cgacccctggcggtcatgctcggcagcgcgccccgtccaacgcgggcccaggggtcttgc cctgcccagggaggggaccgcccacttactggtgttgacaggaggactgggtgcaggatg accacctgtgcactctcggttcagtcgggggttacctctgaagctcccttcccctctacg ccttcctcctacctctcactactctgtcctgaggcagctcccagtggcctccgagagccc cttgagctgcagagagatgcggactctgcaggcccctcactgccagcaccttcatcaccg agagcctccacctgggctggtggtgggggtggagagaatcaaggagcaggaagagaggtg aagcgacaccgccgaggagccccctgctctgttcttcatcctagacacagggagcaccgg cttctccgcagccttgcctggcgctgccctgcagccacccaccccccttctgaaggatgc agccagcacagagagagggaagcgagacaggaagtgacctcaatttccccggccacagaa agggcggctccccctgcctctgcagctgggtgggctcagagatgtccccatggggtcctc ctgggtggccagactgtgcccagggaccccttcagggctcttgagatgggggagctctgt ggctcttccagggcccacgccacgggctgtgggcagtctggctgcctgcgcaggaggcag gcgtttcctatcggaggaccccaagcccaaggcccccttcctgctgggcctcgggcaact caagtgattcctctctggtctggctgtgggagttggttccttcttgcctgggcagcccct gaagcacagagaatcagctcccaagcccatccttccttcctactctgggagcctggtgga ggagcacaaagtcctgtctgtccttccagatcccacttcctgtcccacccagccccaaat gtccgtggaaccggccccactaacctgcatccagctgcctcttccacctgggccctgctc tcagatggaagtgtcaccaaacacccagatcgccgtgctcctgcttctctggcactgcct ccactgcagccagtgaggtctgatcttctcgttcacctctcagagtggctcagtggtgaa acgttgtctgcagcaggggcagggagtggaggttgcggcatcgtctgggcagaggagctg gggcgtttgccagaagtctccaacgggccatctggtcccagagagcagagcacacagcct caagaaatacgagaaaggcggcaggggcagactctggtaaacatcagcaaacctggatgg agaggaaccgtggatgaggatgaagtggggagtaggtcctccaaatccttccaaagccac ctcatccttccagctagctccaggcaggagcagcaagctccctgtgctgtcctgcatcaa acaggcaaggacagctggacacggagagaagaggctgcaggagaaatgaaattacctcca cagcccaggcaagacagcagccaggggtgcctatctgccagctgtgcaggaaagacccct tccccaccatcccccgctggtcggggagagccacacttgcttttacttatactcagctgc agggaggatggccccgggagcccccagaccacagacaccaacagaggcccggggtggggt gggcggtaccagaggcagctgccgaggactcttggagatgctccctctgctcctcctttt gccaccaggggcctgaaaaaggatttccttccgtgggaaaatccagatcggcctccctgt cacctcttcttcaccctcattcgttcctccccctgggataatctgactgtgcctgttcag ctctgtggctaa