GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:19:05 Sequence gi568815597f:40440583_40647457 : 206875 bp : 43.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9274 9555 282 0 0 82 44 214 0.532 13.45 1.02 Intr + 10074 10207 134 2 2 66 40 80 0.458 0.44 1.03 Intr + 10349 10506 158 2 2 -65 116 218 0.595 8.95 1.04 Intr + 13621 13706 86 2 2 109 94 63 0.785 8.54 1.05 Intr + 16363 16489 127 0 1 25 82 175 0.779 10.85 1.06 Intr + 16762 16857 96 2 0 53 111 32 0.553 1.98 1.07 Term + 21839 23007 1169 0 2 62 44 472 0.375 32.20 1.08 PlyA + 24035 24040 6 1.05 2.00 Prom + 37171 37210 40 0.24 2.01 Init + 38780 38906 127 1 1 86 100 132 0.986 13.73 2.02 Intr + 41181 41272 92 1 2 96 109 97 0.999 12.31 2.03 Intr + 48506 48632 127 1 1 56 86 113 0.901 8.15 2.04 Term + 54339 55477 1139 1 2 102 42 571 0.959 46.03 2.05 PlyA + 55603 55608 6 1.05 3.00 Prom + 64242 64281 40 -4.26 3.01 Init + 73963 75016 1054 0 1 72 59 573 0.015 47.17 3.02 Intr + 92561 92599 39 0 0 79 110 35 0.058 3.00 3.03 Intr + 93144 93226 83 1 2 57 39 48 0.036 -3.84 3.04 Intr + 100004 100130 127 1 1 60 82 76 0.504 4.45 3.05 Intr + 101033 101128 96 0 0 95 109 34 0.557 6.18 3.06 Term + 105980 106878 899 0 2 62 34 473 0.435 31.84 3.07 PlyA + 107034 107039 6 1.05 4.00 Prom + 107066 107105 40 -4.76 4.01 Init + 108843 108949 107 2 2 77 78 87 0.356 6.29 4.02 Intr + 110616 110643 28 0 1 63 73 24 0.159 -3.48 4.03 Intr + 117013 117237 225 0 0 49 39 174 0.235 6.78 4.04 Intr + 122867 123077 211 1 1 56 68 116 0.400 4.89 4.05 Term + 124801 124895 95 2 2 105 47 18 0.303 -2.61 4.06 PlyA + 125232 125237 6 1.05 5.02 PlyA - 126501 126496 6 1.05 5.01 Sngl - 153419 153141 279 2 0 33 42 235 0.295 8.83 5.00 Prom - 158518 158479 40 -0.46 6.11 PlyA - 158635 158630 6 1.05 6.10 Term - 165330 165231 100 2 1 55 36 74 0.425 -3.50 6.09 Intr - 169852 169729 124 2 1 101 75 87 0.841 8.34 6.08 Intr - 179837 179780 58 2 1 116 116 60 0.931 10.06 6.07 Intr - 185126 184976 151 1 1 69 99 22 0.841 1.56 6.06 Intr - 186147 186044 104 0 2 125 47 269 0.957 25.47 6.05 Intr - 188367 188228 140 1 2 109 94 236 0.978 26.48 6.04 Intr - 188790 188689 102 1 0 130 94 127 0.758 17.65 6.03 Intr - 192599 192487 113 1 2 142 92 43 0.997 10.12 6.02 Intr - 195475 195334 142 2 1 5 56 286 0.996 16.51 6.01 Init - 201343 201127 217 1 1 72 81 120 0.540 8.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 73963 75084 1122 0 0 72 42 579 0.922 48.40 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:40440583_40647457|GENSCAN_predicted_peptide_1|683_aa MVVLHQSLGPKNAGSISSRRSQADDARRDRILGAPERSQPGAHRWGRLRWRSGSPGGRWE PCSSQPLRRRTGPRLPAAALRGSLAAQCRNPAAAHLQGLLRVNTWRLQLLAKETDGAQVG RYALRADDRHKSRSRILILGSSERSADAVMLQQLLITLPTEASTWVKLRHPKAATERVAL WEDVTKMFKAEALLSQDADETQGESLESRVTLGSLTAESQELLTFKDVSVDFTQEEWGQL APAHRNLYREVMLENYGNLVSVGCQLSKPGVISQLEKGEEPWLMERDISGVPSSDLKSKT KTKESALQNDISWEELHCGLMMERFTKGSSMYSTLGRISKCNKLESQQENQRMGKGQIPL MCKKTFTQERGQESNRFEKRINVKSEVMPGPIGLPRKRDRKYDTPGKRSRYNIDLVNHSR SYTKMKTFECNICEKIFKQLIHLTEHMRIHTGEKPFRCKECGKAFSQSSSLIPHQRIHTG EKPYECKECGKTFRHPSSLTQHVRIHTGEKPYECRVCEKAFSQSIGLIQHLRTHVREKPF TCKDCGKAFFQIRHLRQHEIIHTGVKPYICNVCSKTFSHSTYLTQHQRTHTGERPYKCKE CGKAFSQRIHLSIHQRVHTGVKPYECSHCGKAFRHDSSFAKHQRIHTGEKPYDCNECGKA FSCSSSLIRHCKTHLRNTFSNVV >gi568815597f:40440583_40647457|GENSCAN_predicted_CDS_1|2052_bp atggttgttcttcatcagagtctggggcccaagaacgcaggctccatctcctcccgcagg agccaggctgatgatgctcgcagggatcggatactgggagcccctgaacgcagccaaccc ggcgcgcaccggtgggggcgtctgcgctggcggagcggctccccgggaggacgctgggaa ccatgctctagccagccgctgcgcaggcgcactgggccccgactgcccgccgcagcgcta cgtgggagcttggccgcgcagtgccggaacccggctgcagcgcatctgcaggggctgctg agagtaaatacttggcgcctccagctgctggccaaggagacagatggagctcaagttggg agatacgccctgagagccgatgatagacacaagtccagatctcggattttgatactaggg tcctcagaaaggagtgcggacgccgtcatgctgcagcagctcctgatcaccctgcccacc gaggccagcacctgggtgaagttgcgtcatccaaaggcggccacggagcgggtggccctg tgggaggatgtgactaagatgtttaaagcagaagctctgctgtctcaggatgctgatgag acccagggcgaaagtttagagagtagagtgacccttggatccctgacagcagaatcccag gaactgttaaccttcaaggacgtatctgtggacttcactcaggaggagtgggggcagctg gcccctgctcaccggaatctgtaccgggaggtgatgctggagaactatgggaacctggtc tcagtgggatgtcagctttccaaacctggcgtgatttcccagttggagaaaggagaagaa ccatggctgatggagagagatatttcaggagttccaagttcagacttgaagagcaaaaca aaaaccaaagagtcagccttacagaatgatatttcgtgggaagaactacattgtggccta atgatggaaagatttacaaaaggaagcagcatgtattccaccttgggaagaatctccaaa tgtaataagctagaaagccaacaagagaaccaaagaatgggtaaggggcaaatccccctg atgtgcaagaaaacattcactcaggagagaggccaagagtctaatagatttgagaaaaga attaatgtgaagtcagaagttatgccaggaccaataggtcttccaagaaaaagagatcgt aaatatgacacacctggaaagagaagcagatacaacatagatttagttaatcattcaagg agttatacaaaaatgaaaacctttgagtgtaatatttgtgaaaaaatcttcaaacagctt attcaccttactgaacacatgagaattcataccggggagaaacctttcagatgtaaggaa tgtggaaaagcctttagccaaagttcatctcttattccgcatcagagaattcatactggt gagaaaccctatgaatgtaaggagtgtgggaaaaccttcagacatccttcatcgcttact caacatgttagaattcataccggggaaaagccctatgaatgtagggtatgtgagaaagcc ttcagccagagcattggactgatccagcatttgagaactcatgttagagagaaacctttt acatgcaaagactgtggaaaagcgtttttccagattagacaccttaggcaacatgagatt attcatactggtgtgaaaccctatatttgtaatgtatgtagtaaaaccttcagccatagt acatacctaactcaacaccagagaactcatactggagaaagaccatataaatgtaaggaa tgtgggaaagcctttagccagagaatacatctttctatccatcagagagtccatactgga gtaaaaccttatgaatgcagtcattgtgggaaagcctttaggcatgattcatcctttgct aaacatcagagaattcatactggagaaaaaccttatgattgtaatgagtgtggaaaagcc ttcagctgtagttcatcccttattagacactgcaaaacacatttaagaaataccttcagc aatgttgtgtga >gi568815597f:40440583_40647457|GENSCAN_predicted_peptide_2|494_aa MPQQLLITLPTEASTWVKLQHPKKAVEGAPLWEDVTKMFEGEALLSQDAEDVKTQRESLE DEVTPGLPTAESQELLTFKDISIDFTQEEWGQLAPAHQNLYREVMLENYSNLVSVDLKSK IETIESTAKSTISQERLYHGIMMESFMRDDIIYSTLRKVSTYDDVLERHQETCMRDVRQA ILTHKKRVQETNKFGENIIVHSNVIIEQRHHKYDTPTKRNTYKLDLINHPTSYIRTKTYE CNICEKIFKQPIHLTEHMRIHTGEKPFRCKECGRAFSQSASLSTHQRIHTGEKPFECEEC GKAFRHRSSLNQHHRTHTGEKPYVCDKCQKAFSQNISLVQHLRTHSGEKPFTCNECGKTF RQIRHLSEHIRIHTGEKPYACTACCKTFSHRAYLTHHQRIHTGERPYKCKECGKAFRQRI HLSNHKTVHTGVKAYECNRCGKAYRHDSSFKKHQRHHTGEKPYECNECGKAFSYNSSLSR HHEIHRRNAFRNKV >gi568815597f:40440583_40647457|GENSCAN_predicted_CDS_2|1485_bp atgccacagcagctcctgatcaccctgcctaccgaggccagcacctgggtgaagctgcaa catccaaagaaggccgtggagggggcgcccctgtgggaggatgtgactaaaatgtttgaa ggagaagctctgctgtctcaggatgctgaggacgtaaagacccagagagaaagtttagag gatgaagtgacccctggactcccgacagcagaatcccaggaattgttgactttcaaggac atatctattgacttcacccaggaagagtgggggcagctggctcctgctcaccagaatcta taccgagaggtgatgctggagaactacagcaacttggtgtcagtggacttgaagagtaaa atagaaaccattgagtcaactgcaaagagtaccatttcacaggagcgcttatatcatggc attatgatggaaagtttcatgagggatgatataatttattccacgttgagaaaagtctcc acatatgatgatgtcttagaaaggcaccaggaaacttgtatgagagatgtgagacaagcc atcttgacccataagaagagagtccaagaaactaacaaatttggggaaaatatcattgtg cattcaaatgttattattgaacagaggcaccataaatatgatacacctacaaagcggaac acatacaaattagatttgattaatcatccaacaagttacataagaacaaaaacctatgaa tgtaatatatgtgaaaaaatcttcaaacaacctattcaccttactgaacatatgagaatt catactggtgagaaacctttcagatgtaaggaatgtggaagggcctttagtcaaagtgca tccctcagtacacaccagagaatccatactggtgagaaaccctttgaatgtgaggaatgt gggaaagccttcagacatcgctcatcacttaatcagcatcatagaactcacactggggag aaaccctatgtatgtgataaatgtcagaaagctttcagccagaacattagcttggttcaa catttgaggactcattctggagagaaaccttttacttgcaatgaatgtgggaaaaccttt agacagattagacaccttagtgaacatataagaattcataccggggagaagccctatgca tgcactgcatgttgtaaaacctttagtcatagagcgtatctaacacatcaccagagaatc catactggggagagaccctacaaatgtaaagaatgtggaaaagcctttaggcagaggata caccttagcaaccataaaactgttcatacaggagtgaaagcatatgaatgcaaccgctgt ggaaaagcctataggcatgattcatcctttaaaaaacatcagagacatcacactggagaa aaaccttacgaatgtaacgaatgtggaaaagccttcagctataactcttcacttagtcga catcatgaaatacacaggaggaacgccttccgaaataaggtgtaa >gi568815597f:40440583_40647457|GENSCAN_predicted_peptide_3|765_aa MAETREEETVSAEASGFSDLSDSEFLEFLDLEDAQESKALVNMPGPSSESLGKDDKPISL QNWKRGLDILSPMERFHLKYLYVTDLATQNWCELQTAYGKELPGFLAPEKAAVLDTGASI HLARELELHDLVTVPVTTKEDAWAIKFLNILLLIPTLQSEGHIREFPVFGEGEGVLLVGV IDELHYTAKGELELAELKTRRRPMLPLEAQKKKDCFQVSLYKYIFDAMVQGKVTPASLIH HTKLCLEKPLGPSVLRHAQQGGFSVKSLGDLMELVFLSLTLSDLPVIDILKIEYIHQETA TVLGTEIVAFKEKEVRAKVQHYMAYWMGHREPQGVDVEEAWKCRTCTYADIYFCVRAAEN DQLPVSSYPEHLLFYMNFKVIKITNIVKVHKAESVTFQDVAVDFTAEEWQLLDCAERTLY WDVMLENYRNLISVGCPITKTKVILKVEQGQEPWMVEGANPHESSPESDYPLVDEPGKHR ESKDNFLKSVLLTFNKILTMERIHHYNMSTSLNPMRKKSYKSFEKCLPPNLDLLKYNRSY TVENAYECSECGKAFKKKFHFIRHEKNHTRKKPFECNDCGKAYSRKAHLATHQKIHNGER PFVCNDCGKAFMHKAQLVVHQRLHTGEKPYECSQCGKTFTWNSSFNQHVKSHTLEKSFEC KECGKTFRYSSSLYKHSRFHTGEKPYQCIICGKAFGNTSVLVTHQRIHTGEKPYSCIECG KAFIKKSHLLRHQITHTGEKPYECNRCGKAFSQKSNLIVHQKIHT >gi568815597f:40440583_40647457|GENSCAN_predicted_CDS_3|2298_bp atggcagagacaagagaagaggagacagtgtcagcagaagcctcagggttctcagacttg agtgactcagagttcctggagtttctggacctagaagatgcccaagagtcaaaggcttta gttaacatgcctggcccatcttctgaatcccttgggaaggatgacaaacccataagctta caaaactggaaaagaggattggatatattatcacccatggagagattccaccttaaatat ttatatgtcactgacctggctactcagaactggtgtgaactgcaaacagcatatgggaag gagcttcctggtttcttggcacctgagaaggcagctgtgttggacactggtgccagcata cacctagctagagaactagaacttcatgatcttgtgactgtcccagtcaccactaaagaa gatgcttgggcaattaagtttctgaacatacttttgctgattcctaccctgcagtcagaa gggcacatcagagagtttccagtgtttggggaaggggagggtgtacttcttgttggagtg attgatgagctgcactatacagccaagggggaactggagctggcggaactcaagacacgc aggcgccctatgctccctctggaagctcagaagaagaaagactgttttcaagtcagccta tacaaatatatctttgatgccatggtacaaggaaaagtgacccctgctagcctaatccac cacacaaagttgtgtctagaaaagccactggggccatcagtgctgaggcatgcccagcag ggaggcttctctgtgaagtctttgggtgacctcatggaacttgtcttcttgtctctaaca ctgtcagacctcccagttattgatatcttgaagattgagtatatccaccaagagactgcc actgtgctgggtactgagattgtagccttcaaagagaaggaggtgagagccaaggtgcag cattatatggcctactggatgggccaccgagagccccagggagttgacgtggaggaggct tggaagtgccggacgtgtacctatgcagacatttatttctgtgtaagagctgcagaaaat gatcagcttccagtctctagctatcctgagcatttactcttctacatgaattttaaggtc atcaaaataacaaatattgtcaaggttcataaagcagaatcagtgacattccaggatgtg gctgtggatttcactgcagaggagtggcagctgcttgattgtgctgagagaaccctgtat tgggatgtgatgttggagaactatagaaacctcatctcagtgggatgtccaattaccaaa acaaaagtgatcctcaaggtagagcaaggacaagagccatggatggtggagggagcgaat ccacacgagagctctccagaatctgactacccacttgttgatgaaccagggaagcatcgg gaaagcaaagacaattttttgaagtcagttttgctcacattcaataaaattctgactatg gagagaatccaccattataatatgagcacaagtcttaatccaatgagaaaaaaatcatat aaatcgtttgagaagtgtttgccacctaatttagacttacttaaatataatagaagttat actgtagaaaacgcttatgaatgcagtgaatgcgggaaagccttcaaaaagaagtttcat ttcattagacatgaaaaaaatcatacaaggaaaaaaccttttgaatgcaatgactgtgga aaagcctatagcaggaaggcacaccttgcaactcatcagaaaattcataatggagagaga ccctttgtgtgcaatgattgtgggaaggcgtttatgcataaagcccaactcgtggtccac cagagacttcacactggagagaagccttatgagtgcagtcaatgtgggaaaacattcact tggaactcctcatttaatcaacacgtgaaatctcatacacttgagaagtcatttgaatgt aaggaatgtgggaaaaccttcaggtatagttcatccctttataaacattccagatttcat acaggagagaaaccctaccagtgtatcatatgtggcaaagcttttggcaacacatccgtg cttgttacacaccaaagaattcatacaggagagaaaccttacagttgtattgaatgtggc aaagccttcatcaagaagtcccatctcctcagacatcagataactcatacaggagagaag ccctatgaatgtaacagatgtgggaaagcattttcccagaagtcaaatcttattgtacat cagaaaattcatacataa >gi568815597f:40440583_40647457|GENSCAN_predicted_peptide_4|221_aa MPGIFRMNSIGIINEDENLLEKSKFFCLATLEYDILMGMTIIIYRASRVKLQTFTVSVTA LKAARPELFIPPVGVGVSLTSGEKQQTFVVSVTAHKGTVEPKSVQQQDLLQTAKEQSIHK CCQVDPVYHMHCCRCLLTEEEFLFIGVLGTCYIQGCGHGLHQRGMGATRLCTKGLYREVM LEIYGNLVIVGTWMKLETIILSRLSQGQKTKHRMFSLIGGN >gi568815597f:40440583_40647457|GENSCAN_predicted_CDS_4|666_bp atgcctggaattttccgtatgaacagcataggcatcattaatgaagatgaaaacttgtta gaaaagagcaagtttttctgcttagcaactttggaatatgacattttaatgggaatgaca attatcatctatagggcttcaagagtgaagctgcagaccttcacagtgagtgtgacagct cttaaggcagcgcgtccggagttgttcattcctcctgttggagtcggcgtctcgctaacc tcaggagagaagcagcagaccttcgtggtgagtgttacagctcataaaggcactgtggag ccaaagagtgtgcagcagcaagatttattgcaaacagcaaaagaacaaagcatccacaag tgttgccaggtggaccctgtgtaccacatgcattgctgtagatgcttgctcaccgaagag gaattcctgttcattggtgttttaggaacctgttacattcaaggatgtggccatggactt caccaaagaggaatgggggcaactagactatgcacaaagggcctttacagagaagtgatg ctggagatctatggcaacctggtcatagtggggacatggatgaagctggaaaccatcatt ctcagcagactatcgcaaggacaaaaaaccaaacaccgcatgttctcactcataggtggg aattga >gi568815597f:40440583_40647457|GENSCAN_predicted_peptide_5|92_aa MRLKRMQTQESAKPAKTLQKPDKVVTAGYKPVANHQCNIAYEKKKKEEGKRVQADKDHVL SLLFAAFEKHQCYNIKDVVGITMQPVCASRKS >gi568815597f:40440583_40647457|GENSCAN_predicted_CDS_5|279_bp atgagactgaagagaatgcaaacacaggaatctgccaaacctgccaaaaccctgcagaaa ccggacaaggttgtcacagctggttataaacctgttgctaatcatcagtgtaatattgca tatgagaagaaaaagaaagaagagggaaagagggtgcaggctgataaagaccacgtttta tccttgctatttgctgccttcgagaaacaccagtgttacaacataaaggatgtggttggc atcacaatgcaacctgtgtgtgcctcaaggaaatcttga >gi568815597f:40440583_40647457|GENSCAN_predicted_peptide_6|416_aa MFNGEPGPASSGASRNVVRSSSISGEICGSQQAGGGAGTTTAKKRRSSLGAKMVAIVGLT QWSKSTLQLPQPEGATKKLRSNIRRSTETGIAVEMRSRVTRQGSRESTDGSTNSNSSDGT FIFPTTRLGAESQFSDFLDGLGPAQIVGRQTLATPPMGDVHIAIMDRSGQLEVEVIEARG LTPKPGSKSLPATYIKVYLLENGACLAKKKTKMTKKTCDPLYQQALLFDEGPQGKVLQVI VWGDYGRMDHKCFMGMAQIMLDELDLSAAVTGCLPTPPCRRSSASGPGQAATEGMLSAKA NSPPEALRVALTPSGLIPGGHDSALKAFYFHRLHKGYPTTEEGGSVQVNQVELECMSGLE WLKLAGHSGTGRSKFHAGPMAASRAEEKGYSITKPLTLDHSLYFISVSYTEYYYAF >gi568815597f:40440583_40647457|GENSCAN_predicted_CDS_6|1251_bp atgtttaacggggagccaggtcctgcctcatctggggcctccaggaatgtggtgcggagc tccagcattagcggtgaaatctgcggatcccagcaagccgggggcggggctgggaccacc accgccaagaagcggcggagcagcctgggtgccaagatggtggccatcgtgggcctgact cagtggagcaagagcacactccagcttccgcagcctgaaggggccaccaagaagctgcgc agcaacatccgccggagcacggagacaggcatcgcggtggagatgcggagccgggtcaca cgccagggcagccgggagtccaccgatgggagcaccaacagcaacagctccgacggcacg ttcatcttccccactacccggctaggggctgaaagccagttcagcgatttcctggatggg ctgggaccagctcagattgtggggcgacagacactggcaacaccacccatgggagatgtg cacattgccatcatggaccggagtggccagctggaggtggaagtgattgaagctcggggc ctgacccccaaaccaggctccaaatccctcccagccacctatatcaaggtttacctgctg gagaatggggcctgcttggccaagaagaagacaaagatgaccaagaagacctgtgatccc ctgtaccagcaggctctgctctttgacgagggaccccagggcaaggtgctgcaggtgatc gtctggggagactatggccgcatggaccacaagtgcttcatgggcatggcccagatcatg ctggacgagctggacctcagcgccgcggtcaccggctgcctccccactccaccgtgccga aggagctctgccagtgggcctgggcaggcagccacagagggcatgttatctgctaaagca aacagtcctcctgaggccctgagggtggccctgaccccctcagggctcattcctggtggg catgactcggccctcaaagccttctacttccatagactccacaagggatatccaacgaca gaggaaggagggagcgtgcaagtgaaccaggtggaactggagtgcatgagtggactagag tggctgaaactggccggccactctggcaccggcaggagcaaattccatgcaggccccatg gcagcttccagggcagaagagaaaggctatagcattactaagcctctcaccttggaccac agcctctactttatcagtgtatcctatactgagtattactatgctttttag