GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:00:43 Sequence gi568815578f:44801534_45006435 : 204902 bp : 43.49% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 1566 1561 6 1.05 1.04 Term - 8418 8321 98 1 2 92 47 18 0.305 -3.77 1.03 Intr - 9098 8642 457 2 1 84 115 448 0.595 39.89 1.02 Intr - 10191 10104 88 2 1 73 42 45 0.303 -1.63 1.01 Init - 20687 20536 152 2 2 72 99 89 0.510 5.96 1.00 Prom - 26145 26106 40 -4.66 2.00 Prom + 31839 31878 40 -3.76 2.01 Init + 34927 35027 101 0 2 68 81 64 0.377 3.43 2.02 Intr + 36461 36621 161 2 2 59 51 94 0.090 2.43 2.03 Term + 55286 55500 215 0 2 -19 51 412 0.727 24.19 2.04 PlyA + 55703 55708 6 1.05 3.00 Prom + 62446 62485 40 -1.86 3.01 Init + 62956 62965 10 0 1 83 94 5 0.350 1.40 3.02 Intr + 84145 84353 209 2 2 -15 113 113 0.218 2.30 3.03 Intr + 84758 84827 70 1 1 122 52 8 0.198 -0.65 3.04 Intr + 86115 86209 95 1 2 42 99 51 0.315 1.28 3.05 Intr + 99998 100300 303 1 0 112 99 347 0.909 34.99 3.06 Intr + 102460 102583 124 0 1 62 106 86 0.989 7.96 3.07 Intr + 103435 103598 164 2 2 77 93 9 0.962 -0.01 3.08 Intr + 104468 104563 96 1 0 36 93 98 0.948 5.31 3.09 Term + 104849 104905 57 1 0 132 35 46 0.953 1.39 3.10 PlyA + 104972 104977 6 -0.45 4.00 Prom + 106162 106201 40 -1.76 4.01 Init + 108611 108803 193 1 1 100 101 392 0.903 40.83 4.02 Intr + 111127 111336 210 2 0 101 40 328 0.905 28.18 4.03 Intr + 115122 115338 217 1 1 65 65 266 0.467 19.56 4.04 Intr + 117373 117512 140 1 2 64 60 244 0.969 19.31 4.05 Intr + 117650 117744 95 0 2 88 97 133 0.993 13.88 4.06 Intr + 120061 120198 138 0 0 111 94 271 0.971 30.66 4.07 Intr + 122628 122723 96 2 0 49 100 146 0.926 12.11 4.08 Intr + 128927 129193 267 1 0 116 66 326 0.833 30.93 4.09 Intr + 130809 130899 91 2 1 96 76 -38 0.146 -4.63 4.10 Intr + 131539 131652 114 2 0 66 95 29 0.087 1.82 4.11 Intr + 133858 133964 107 2 2 102 90 15 0.114 3.03 4.12 Intr + 135104 135197 94 1 1 36 97 87 0.077 3.94 4.13 Intr + 136528 136658 131 2 2 16 121 139 0.886 10.41 4.14 Intr + 136966 137059 94 0 1 54 89 0 0.453 -3.76 4.15 Intr + 137141 137215 75 0 0 97 94 70 0.493 7.99 4.16 Intr + 137593 137708 116 2 2 113 31 5 0.435 -2.53 4.17 Term + 139173 139235 63 2 0 121 35 104 0.562 6.29 4.18 PlyA + 139859 139864 6 1.05 5.09 PlyA - 140625 140620 6 1.05 5.08 Term - 141680 141576 105 2 0 99 42 87 0.975 3.61 5.07 Intr - 142046 141920 127 1 1 98 93 114 0.998 13.58 5.06 Intr - 147344 147197 148 0 1 68 70 180 0.989 13.49 5.05 Intr - 150469 150300 170 0 2 100 83 104 0.999 10.69 5.04 Intr - 153687 153535 153 2 0 98 99 254 0.962 26.69 5.03 Intr - 154952 154853 100 0 1 91 88 76 0.999 7.07 5.02 Intr - 159096 158674 423 1 0 39 102 367 0.076 27.04 5.01 Init - 162674 162506 169 2 1 82 36 105 0.049 4.41 5.00 Prom - 163263 163224 40 -5.96 6.00 Prom + 164332 164371 40 -7.56 6.01 Init + 165036 165070 35 2 2 104 93 3 0.241 1.91 6.02 Intr + 165655 165738 84 1 0 57 92 54 0.165 1.64 6.03 Intr + 170545 170625 81 1 0 91 96 119 0.995 11.75 6.04 Intr + 176910 177038 129 0 0 74 103 136 0.996 13.51 6.05 Intr + 180296 180410 115 2 1 98 90 68 0.987 8.45 6.06 Intr + 185599 185763 165 0 0 74 98 42 0.927 3.96 6.07 Intr + 193557 193724 168 2 0 103 90 155 0.964 17.34 6.08 Intr + 195636 195773 138 2 0 65 80 46 0.806 2.16 6.09 Intr + 198859 198987 129 0 0 80 70 146 0.988 12.89 6.10 Intr + 199634 199820 187 1 1 87 78 240 0.998 22.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 158800 158674 127 1 1 74 102 298 0.841 30.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:44801534_45006435|GENSCAN_predicted_peptide_1|264_aa MPAMKIYTQGLLGKALAGCGVIWFQETKRGGGFIEGMQENVKEVMGQKIVRWLHKEVCEQ LRRLFDIWGLIRWAGTLTSQRRPLPRRHGLAAAPLQARLALPAGGAADQSALRRHRRRHR RRRRRRQQHSAPRAPRAASSWPARRPRRARQPRSGASARPRAAATTPRAVLRVKNGAAKL PKPPAAAAAAAAEAPGAGAGMERSQSRLSLSASFEALAIYFPCMNSFDDEDAGGPFGSPG PISLLTRAYWRPLASSLEPTCHPP >gi568815578f:44801534_45006435|GENSCAN_predicted_CDS_1|795_bp atgcctgccatgaaaatctatacgcaggggttgcttggaaaagctctggctgggtgtggg gttatttggtttcaagaaacaaaacgaggagggggttttattgaaggaatgcaggagaat gtcaaagaggtcatgggccagaagatagtcagatggcttcataaagaggtgtgcgagcag ctgagaaggttgtttgacatctggggcctgattaggtgggcagggaccctcacgtcccag cggcgccccctcccccggcggcacggattggctgcggcgccgctccaagcccgcctcgcg ctgccggcggggggcgccgcggaccagagcgcgctgcgccgccaccgccgccgccaccgc cgccgccgccgccgccgccagcagcacagcgcgcctcgggctccgcgcgccgccagctcc tggcccgcccgccggccccgccgcgcccgccagccccgcagcggagcctcggcccggccc cgggccgccgccaccacgccgcgcgccgtactccgcgtcaagaatggggcggccaagctg cccaagccgcccgccgccgccgccgccgccgcggccgaggcgcccggcgccggcgcgggc atggagcgctcgcagagccgcctcagtctgtccgcctccttcgaggcgctcgccatctac ttcccgtgcatgaactccttcgacgacgaggacgcaggtgggccctttgggtcgccagga cccatctccctcctcacccgcgcctattggaggccccttgcctcgtccctggagccgacc tgccaccctccctga >gi568815578f:44801534_45006435|GENSCAN_predicted_peptide_2|158_aa MVVVSVTIIGGASDYGYQNSLAEMLVIGGAKDHRPSLPGKASLLCTGAARLADAAHSAKA VEEPELYLNFSESTQTQPPVLWLAEGTEEEGRGRGGGGGEEEEEEEKKKKKKKKKKKKKK KKKKKKKKKRKKKKEEEEEKKKGRGRGNGKRKRKRKQL >gi568815578f:44801534_45006435|GENSCAN_predicted_CDS_2|477_bp atggtggttgtgagtgtgacgattattggaggggctagcgattatgggtaccaaaattcc ttggcagagatgctggttattggaggggctaaagatcacagacctagccttccaggaaaa gcttccctgctctgcaccggggctgccagacttgcagatgctgcacactcagcaaaagca gtggaggagcctgagctatatttgaacttctctgagagcacacagacacagccccctgtg ctctggctggcggaggggactgaagaagaaggaagaggaagaggaggaggaggaggagaa gaagaggaagaagaagaaaagaagaagaagaagaagaagaagaagaagaagaagaagaag aagaagaagaagaagaagaagaagaagaggaagaagaagaaggaggaggaggaggagaag aagaaggggagggggagggggaatgggaagaggaagaggaagaggaagcagctctga >gi568815578f:44801534_45006435|GENSCAN_predicted_peptide_3|375_aa MAEGPERKWRSERKWSYRHRRRRFRSRGSRRRRRRCSHCRHRCRRLSSGLRKEEVISLGA SLGRVFVPCSPPTKLKFNQTRMSFLAPVPVFGSSWMSRVSRDLGRYKKCILPIRLWIEKQ EEHWTWSQGMTMDKSELVQKAKLAEQAERYDDMAAAMKAVTEQGHELSNEERNLLSVAYK NVVGARRSSWRVISSIEQKTERNEKKQQMGKEYREKIEAELQDICNDVLELLDKYLIPNA TQPESKVFYLKMKGDYFRYLSEVASGDNKQTTVSNSQQAYQEAFEISKKEMQPTHPIRLG LALNFSVFYYEILNSPEKACSLAKTAFDEAIAELDTLNEESYKDSTLIMQLLRDNLTLWT SENQGDEGDAGEGEN >gi568815578f:44801534_45006435|GENSCAN_predicted_CDS_3|1128_bp atggcagaaggaccggagcggaagtggcgatcggagcggaagtggagctaccgccaccgc cgccgccgattccggagccggggtagtcgccgccgccgccgccgctgcagccactgcagg caccgctgccgccgcctgagtagtgggcttaggaaggaagaggtcatctcgctcggagct tcgctcggaagggtctttgttccctgcagccctcccacgaaattaaaatttaatcaaacg cggatgagttttctagccccagtccccgtgtttggctcctcctggatgagcagagtctcc agagatttgggccgctacaaaaagtgcattttgcccattcggctgtggatagagaagcag gaagagcactggacttggagtcagggaatgacaatggataaaagtgagctggtacagaaa gccaaactcgctgagcaggctgagcgatatgatgatatggctgcagccatgaaggcagtc acagaacaggggcatgaactctccaacgaagagagaaatctgctctctgttgcctacaag aatgtggtaggcgcccgccgctcttcctggcgtgtcatctccagcattgagcagaaaaca gagaggaatgagaagaagcagcagatgggcaaagagtaccgtgagaagatagaggcagaa ctgcaggacatctgcaatgatgttctggagctgttggacaaatatcttattcccaatgct acacaaccagaaagtaaggtgttctacttgaaaatgaaaggagattattttaggtatctt tctgaagtggcatctggagacaacaaacaaaccactgtgtcgaactcccagcaggcttac caggaagcatttgaaattagtaagaaagaaatgcagcctacacacccaattcgtcttggt ctggcactaaatttctcagtcttttactatgagattctaaactctcctgaaaaggcctgt agcctggcaaaaacggcatttgatgaagcaattgctgaattggatacgctgaatgaagag tcttataaagacagcactctgatcatgcagttacttagggacaatctcactctgtggaca tcggaaaaccagggagacgaaggagacgctggggagggagagaactaa >gi568815578f:44801534_45006435|GENSCAN_predicted_peptide_4|746_aa MNASGSGYPLASLYVGDLHPDVTEAMLYEKFSPAGPILSIRVCRDVATRRSLGYAYINFQ QPADAERALDTMNFEMLKGQPIRIMWSQRDPGLRKSGVGNIFIKNLEDSIDNKALYDTFS TFGNILSCKVEDEGCTAGMQRLVVITFAIPFYLNPLSVLFLSFLPVAQVACDEHGSRGFG FVHFETHEAAQQAINTMNGMLLNDRKVFVGHFKSRREREAELGARALEFTNIYVKNLPVD VDEQGLQDLFSQFGKMLSVKVMRDNSGHSRCFGFVNFEKHEEAQKAVVHMNGKEVSGRLL YAGRAQKRVERQNELKRRFEQMKQDRLRRYQGVNLYVKNLDDSIDDDKLRKEFSPYGVIT SAKVMTEGGHSKGFGFVCFSSPEEATKAVTEMNGRIVGTKPLYVALAQRKEERKAILTNQ YMQRLSTMRTLSNPLLGSFQQPSSYFLPAMPQPPAQAAYYGCGPVTPTQPAPRWTSQPPR PSCASMVRPPVVPRRPPAHISSVRQASTQVPRTVPHTQRVANIGTQTTGPSGVGCCTPGR PLLPCKCSSAAHSTYRVQEPAVHIPGQEPLTASMLAAAPLHEQKQMIGERLYPLIHDVHT QLAGKITGMLLEIDNSELLLMLESPESLHAKVTDRAMGSCKQWWMGGFHRNPVDGGGSCI QDDRRGSGRAAGTPGYGAAEGVHALKPEKEILASMAAKRTVFLALSPKALQTLTYFPISL YLYLGSRKSTNMTSYQIQIHYHYLVE >gi568815578f:44801534_45006435|GENSCAN_predicted_CDS_4|2241_bp atgaacgccagcggttctggctacccgcttgcctcgctttacgtgggcgatctgcacccc gacgtgaccgaggccatgctctatgagaagttctctcccgccggccccatcctgtccatc cgcgtgtgccgcgatgtagccacccggcgctcgctgggctacgcctacatcaacttccag cagcccgcggacgcggagcgggcactggacacaatgaactttgagatgctcaaaggccag cctattcgcatcatgtggtcccagcgagacccaggacttcgcaagtcaggtgtgggcaac atcttcatcaagaacctggaggactccattgacaacaaggctttatatgataccttctcc acctttgggaacatcctctcttgcaaggtagaggatgaagggtgcacagcaggcatgcag agactggtcgtgatcacctttgccattcctttctacctgaaccctctgtcagtactcttc ctgtccttcctccctgtggcccaggtggcgtgtgacgagcatggctcccggggtttcggc tttgtccattttgagacccatgaggccgcacagcaggccatcaacaccatgaatgggatg ctgctgaatgaccgcaaagtctttgtgggtcacttcaagtctcgacgggagcgggaggcg gagctgggggcgcgggccctggagttcaccaacatctacgtgaagaacctcccggtggat gtggacgagcaaggcctgcaggacctcttctcccagtttgggaaaatgctgagtgtgaag gtgatgagggacaacagcggccactcgcggtgctttggctttgtcaactttgagaagcat gaggaagcccagaaggccgtggtccatatgaacgggaaggaggtgagcgggcggctgctg tacgcgggccgggcccaaaagcgcgtggagcggcagaatgaactgaagcgcaggtttgag cagatgaagcaggaccggctgaggcgttaccagggtgtgaacttgtatgtgaagaatctg gacgactccattgatgacgacaaactgaggaaagagttctctccctatggagtaattacc agtgcgaaggtgatgacagagggtggccacagcaaggggtttggctttgtgtgtttttcc tccccagaagaggcgacaaaggccgtgacagagatgaacgggcgcatcgtgggcaccaag ccactctacgtggcactggcccagcgcaaagaggagcggaaggccatcttgaccaaccag tacatgcagcgcctctccaccatgcggaccctgagcaaccccctcctgggctcctttcag cagccctccagctacttcctgcctgccatgccccagcctccagcccaggctgcatactat ggctgtggcccagtgacacccacccagcctgcccccaggtggacatcccagccacctaga ccttcctgtgcctcaatggtccggccaccagttgtgcctcggcgccccccggcccacatc agcagtgtcaggcaggcctccacccaggtgccacgcacggtgcctcatacccagagagta gccaacattggtactcagaccacaggacccagtggggtaggatgctgtacaccaggccgg ccgctcctgccgtgcaaatgttcctcagcagcacatagcacctatcgggtccaggagccg gctgtgcacatcccaggacaggagcccctgaccgcgtccatgctggctgcggcgcccctg catgagcaaaagcagatgattggggagcgtctctacccccttatccatgatgtccacacc cagctggctggcaagatcacgggcatgctgctggagattgacaactcagagctgttgctc atgctggagtctccagaatccctccatgccaaggtgacagacagggccatgggatcttgc aagcagtggtggatgggagggttccacagaaaccctgtggatggaggaggatcctgtatc caggatgatagacgaggcagtggccgtgctgcaggcacaccaggctatggagcagccgaa ggcgtacatgcactgaaaccagaaaaggaaatcctcgcttccatggctgccaaaaggaca gtgtttctggctctcagccctaaggccctgcaaactctaacttatttcccaattagtctg tatctatacttgggctctagaaaatccaccaacatgacctcctaccagatccaaatccac taccactacctagtggaataa >gi568815578f:44801534_45006435|GENSCAN_predicted_peptide_5|464_aa MAYLCNAICMTHMVTLDPVDQALPGATTATSVPGPQCDASCMMLFAVVPFNGIALAGHNL PDSGASLLPCVIHGVLATQGAQSMLPPRVQGSLSSKPSTRRGRDAPGLQVSQAPPPRRGF AVGRRYSPPALAPGRCAAPHGGGRKELPTRRPGHGMAPKFPDSVEELRAAGNESFRNGQY AEASALYGRALRVLQAQGSSDPEEESVLYSNRAACHLKDGNCRDCIKDCTSALALVPFSI KPLLRRASAYEALEKYPMAYVDYKTVLQIDDNVTSAVEGINRMTRALMDSLGPEWRLKLP SIPLVPVSAQKRWNSLPSENHKEMAKSKSKETTATKNRVPSAGDVEKARVLKEEGNELVK KGNHKKAIEKYSESLLCSNLESATYSNRALCYLVLKQYTEAVKDCTEALKLDGKNVKAFY RRAQAHKALKDYKSSFADISNLLQIEPRNGPAQKLRQEVKQNLH >gi568815578f:44801534_45006435|GENSCAN_predicted_CDS_5|1395_bp atggcatatttgtgtaatgccatttgcatgacccacatggtcaccctggaccccgtggat caggccctgcctggtgccaccaccgccacttctgtgccaggcccccagtgtgatgcaagt tgcatgatgctctttgctgtagtgccttttaatggcatagctttggcaggacataatctc cctgatagcggggccagcctccttccctgcgtcatccacggcgttctagcaactcaggga gcccagagcatgctgccgccccgcgtgcaagggagcctaagttccaaaccgagcacgcgc agagggcgggacgctccgggcctccaggtctcgcaggccccgccccctcgccgcgggttc gctgttgggcggagatattcgccgccggcgcttgcgcccggaaggtgtgccgcaccacac gggggaggaaggaaggagctcccaactcgccggcctggccacgggatggcccccaaattc ccagactctgtggaggagctccgcgccgccggcaatgagagtttccgcaacggccagtac gccgaggcctccgcgctctacggccgcgcgctgcgggtgctgcaggcgcaaggttcttca gacccagaagaagaaagtgttctctactccaaccgagcagcatgtcacttgaaggatgga aactgcagagactgcatcaaagattgcacttcagcactggccttggttcccttcagcatt aagcccctgctgcggcgagcatctgcttatgaggctctggagaagtaccctatggcctat gttgactataagactgtgctgcagattgatgataatgtgacgtcagccgtagaaggcatc aacagaatgaccagagctctcatggactcgcttgggcctgagtggcgcctgaagctgccc tcaatccccttggtgcctgtttcagctcagaagaggtggaattccttgccttcggagaac cacaaagagatggctaaaagcaaatccaaagaaaccacagctacaaagaacagagtgcct tctgctggggatgtggagaaagccagagttctgaaggaagaaggcaatgagcttgtaaag aagggaaaccataagaaagctattgagaagtacagtgaaagcctcttgtgtagtaacctg gaatctgccacgtacagcaacagagcactctgctatttggtcctgaagcagtacacagaa gcagtgaaggactgcacagaagccctcaagctggatggaaagaacgtgaaggcattctac agacgggctcaagcccacaaagcactcaaggactataaatccagctttgcagacatcagc aacctcctacagattgagcctaggaatggtcctgcacagaagttgcggcaggaagtgaag cagaacctacactaa >gi568815578f:44801534_45006435|GENSCAN_predicted_peptide_6|411_aa METVQLRNPPRRMASFITDVQCLPNGLHILLSSSEPDIGRQLKKLDEDSLTKQPEEVFDV LEKLGEGSYGSVYKAIHKETGQIVAIKQVPVESDLQEIIKEISIMQQCDSPHVVKYYGSY FKNTDLWIVMEYCGAGSVSDIIRLRNKTLTEDEIATILQSTLKGLEYLHFMRKIHRDIKA GNILLNTEGHAKLADFGVAGQLTDTMAKRNTVIGTPFWMAPEVIQEIGYNCVADIWSLGI TAIEMAEGKPPYADIHPMRAIFMIPTNPPPTFRKPELWSDNFTDFVKQCLVKSPEQRATA TQLLQHPFVRSAKGVSILRDLINEAMDVKLKRQESQQREVDQDDEENSEEDEMDSGTMVR AVGDEMGTVRVASTMTDGANTMIEHDDTLPSQLGTMVINAEDEEEEGTMKX >gi568815578f:44801534_45006435|GENSCAN_predicted_CDS_6|1233_bp atggagacggtacagctgaggaacccgccgcgccggatggcatcttttattactgatgtt cagtgtcttcctaatggtcttcacatcctcctcagctcttcagaacctgacattgggagg cagctgaaaaagttggatgaagatagtttaaccaaacaaccagaagaagtatttgatgtc ttagagaaacttggagaagggtcctatggcagcgtatacaaagctattcataaagagacc ggccagattgttgctattaagcaagttcctgtggaatcagacctccaggagataatcaaa gaaatctctataatgcagcaatgtgacagccctcatgtagtcaaatattatggcagttat tttaagaacacagacttatggatcgttatggagtactgtggggctggttctgtatctgat atcattcgattacgaaataaaacgttaacagaagatgaaatagctacaatattacaatca actcttaagggacttgaataccttcattttatgagaaaaatacaccgagatatcaaggca ggaaatattttgctaaatacagaaggacatgcaaaacttgcagattttggggtagcaggt caacttacagataccatggccaagcggaatacagtgataggaacaccattttggatggct ccagaagtgattcaggaaattggatacaactgtgtagcagacatctggtccctgggaata actgccatagaaatggctgaaggaaagcccccttatgctgatatccatccaatgagggca atcttcatgattcctacaaatcctcctcccacattccgaaaaccagagctatggtcagat aactttacagattttgtgaaacagtgtcttgtaaagagccctgagcagagggccacagcc actcagctcctgcagcacccatttgtcaggagtgccaaaggagtgtcaatactgcgagac ttaattaatgaagccatggatgtgaaactgaaacgccaggaatcccagcagcgggaagtg gaccaggacgatgaagaaaactcagaagaggatgaaatggattctggcacgatggttcga gcagtgggtgatgagatgggcactgtccgagtagccagcaccatgactgatggagccaat actatgattgagcacgatgacacgttgccatcacaactgggcaccatggtgatcaatgca gaggatgaggaagaggaaggaactatgaaaann