GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:40:28 Sequence gi568815576r:30563440_30764003 : 200564 bp : 49.25% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.16 PlyA - 2877 2872 6 1.05 1.15 Term - 13690 13607 84 1 0 140 48 155 0.751 14.35 1.14 Intr - 15559 15398 162 1 0 100 105 325 0.940 35.57 1.13 Intr - 15864 15698 167 1 2 56 97 331 0.999 30.48 1.12 Intr - 16496 16312 185 1 2 81 99 244 0.970 24.23 1.11 Intr - 16739 16614 126 1 0 86 96 131 0.999 13.49 1.10 Intr - 17262 17132 131 0 2 61 91 211 0.996 18.19 1.09 Intr - 17662 17573 90 1 0 88 72 123 0.997 10.89 1.08 Intr - 17969 17895 75 2 0 114 78 75 0.971 8.81 1.07 Intr - 18205 18089 117 1 0 96 97 244 0.999 26.76 1.06 Intr - 21015 20926 90 0 0 121 86 106 0.986 13.89 1.05 Intr - 21278 21107 172 1 1 67 105 305 0.986 30.05 1.04 Intr - 23956 23847 110 1 2 123 101 244 0.994 28.28 1.03 Intr - 24735 24582 154 2 1 107 100 170 0.999 20.17 1.02 Intr - 25831 25752 80 1 2 55 36 144 0.981 4.25 1.01 Init - 34917 34897 21 0 0 60 113 10 0.295 0.53 1.00 Prom - 38239 38200 40 -6.06 2.00 Prom + 40257 40296 40 -6.16 2.01 Init + 43893 43956 64 2 1 88 89 85 0.978 7.96 2.02 Intr + 47432 47624 193 0 1 92 34 150 0.984 8.55 2.03 Intr + 49434 49603 170 0 2 49 85 208 0.841 16.19 2.04 Intr + 50910 51062 153 1 0 124 92 169 0.999 20.94 2.05 Intr + 51832 52034 203 2 2 93 75 189 0.860 17.10 2.06 Intr + 52162 52348 187 0 1 80 53 110 0.993 5.96 2.07 Intr + 53891 54056 166 0 1 69 57 180 0.391 12.02 2.08 Intr + 59529 59644 116 0 2 106 113 56 0.778 9.99 2.09 Intr + 72998 73630 633 0 0 46 106 737 0.734 63.41 2.10 Term + 83159 83592 434 0 2 103 53 485 0.997 41.96 2.11 PlyA + 87061 87066 6 1.05 3.07 PlyA - 87760 87755 6 1.05 3.06 Term - 88186 88175 12 1 0 105 32 9 0.239 -4.80 3.05 Intr - 91239 90834 406 2 1 36 -19 360 0.544 14.45 3.04 Intr - 100536 100002 535 1 1 81 48 806 0.003 67.88 3.03 Intr - 115705 115592 114 2 0 24 53 133 0.625 3.82 3.02 Intr - 116189 115994 196 2 1 67 54 141 0.763 7.59 3.01 Init - 120349 120344 6 1 0 71 110 0 0.516 1.49 3.00 Prom - 125210 125171 40 -2.16 4.05 PlyA - 125407 125402 6 1.05 4.04 Term - 131258 131121 138 2 0 98 45 116 0.926 6.26 4.03 Intr - 131926 131437 490 0 1 40 3 433 0.507 23.01 4.02 Intr - 132212 132009 204 1 0 42 64 128 0.765 4.12 4.01 Init - 147017 146953 65 2 2 90 80 94 0.971 9.42 4.00 Prom - 147162 147123 40 -5.86 5.04 PlyA - 148507 148502 6 1.05 5.03 Term - 153587 153412 176 0 2 40 49 153 0.964 4.52 5.02 Intr - 155008 154982 27 2 0 128 116 -4 0.893 4.49 5.01 Init - 158151 158145 7 0 1 73 97 5 0.590 0.72 5.00 Prom - 160962 160923 40 -6.16 6.00 Prom + 163979 164018 40 -4.16 6.01 Init + 167810 167895 86 1 2 82 76 49 0.690 3.41 6.02 Intr + 177722 178063 342 2 0 86 100 261 0.457 21.65 6.03 Intr + 180284 180396 113 2 2 91 78 96 0.683 8.92 6.04 Intr + 189665 189714 50 0 2 31 78 34 0.045 -4.90 6.05 Intr + 193426 193585 160 0 1 110 32 91 0.120 5.26 6.06 Intr + 199162 199241 80 2 2 97 66 43 0.066 2.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 104164 104796 633 0 0 69 37 225 0.922 11.69 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:30563440_30764003|GENSCAN_predicted_peptide_1|587_aa MKLPTFKYERGSATNYITRNKARKKLQLSLADFRRLCILKGIYPHEPKHKKKVNKGSTAA RTFYLIKDIRFLLHEPIVNKFREYKVFVRKLRKAYGKSEWNTVERLKDNKPNYKLDHIIK ERYPTFIDALRDLDDALSMCFLFSTFPRTGKCHVQTIQLCRRLTVEFMHYIIAARALRKV FLSIKGIYYQAEVLGQPIVWITPYAFSHDHPTDVDYRVMATFTEFYTTLLGFVNFRLYQL LNLHYPPKLEGQAQAEAKAGEGTYALDSESCMEKLAALSASLARVVVPATEEEAEVDEFP TDGEMSAQEEDRRKELEAQEKHKKLFEGLKFFLNREVPREALAFIIRSFGGEVSWDKSLC IGATYDVTDSRITHQIVDRPGQQTSVIGRCYVQPQWVFDSVNARLLLPVAEYFSGVQLPP HLSPFVTEKEGDYVPPEKLKLLALQRGEDPGNLNESEEEEEEDDNNEGDGDEEGENEEEE EDAEAGSEKEEEARLAALEEQRMEGKKPRVMAGTLKLEDKQRLAQEEESEAKRLAIMMMK KREKYLYQKIMFGKRRKIREANKLAEKRKAHDEAVRSEKKAKKARPE >gi568815576r:30563440_30764003|GENSCAN_predicted_CDS_1|1764_bp atgaagctgccaaccttcaagtatgaacgaggctcggccaccaactacatcacccggaac aaagcccggaagaagctccagctgagcttggctgactttaggcggctgtgcattctgaag ggcatttatccccatgaacccaaacacaagaagaaggttaacaagggttctacagcagcc cgaacgttttaccttatcaaagacatcaggtttctcctccacgaacccattgtcaacaag ttccgtgaatacaaggtgttcgtccggaagctccggaaggcttatgggaagagcgagtgg aacactgtagagcgtttaaaggacaataagcccaactacaaactcgaccacatcatcaag gaacggtatcccacgttcatcgatgccctgcgggacctggacgatgccctctccatgtgc ttcctgttttccaccttcccgcggactggcaagtgccacgtgcagaccattcagctgtgc cgccggctcactgtggagttcatgcactacattatcgctgcccgtgccctgcgcaaggtc ttcctgtccatcaaaggcatttactaccaggccgaggtactggggcagcccatcgtgtgg atcactccctatgccttctcccatgaccacccgacagacgtggactacagggtcatggcc accttcaccgagttctacaccacgctgctgggctttgtcaacttccgcctttaccagttg ctcaacctccactatcccccgaagctcgagggtcaggcccaagcagaggcaaaggccggt gagggcacctacgcgttggactccgagagttgtatggagaaactggcagccctcagtgcc agcctggcccgcgtggtggtgcctgccacagaggaggaggccgaggtggatgagtttccc accgatggggagatgtcagcgcaggaggaagaccgcaggaaggagctggaggcgcaggag aagcacaagaagctttttgagggcctgaagttcttcctgaaccgagaggtgccccgtgag gccctggccttcatcatcaggagttttggtggggaagtgtcctgggacaaatctttgtgc attggggccacctatgacgtcacagactcccgcatcacccatcagattgtcgaccggcct gggcagcagacctcagtcattggcaggtgctacgtgcagccccagtgggtgtttgactca gtgaacgccaggctccttctccccgtggcagagtacttctctggggtgcagctgccccca cacctttcaccctttgtgaccgagaaggaaggagattacgttccacctgagaagctgaag ctgctggctctgcagcggggagaggacccaggaaacctgaatgagtcagaagaggaggag gaagaggacgacaacaacgaaggtgatggtgatgaagagggagaaaatgaggaggaggag gaagatgcagaggctggttcagaaaaggaggaagaggcccggctggcagccctggaagag cagaggatggaggggaagaagcccagggtgatggcaggcaccttgaagctggaggataag cagcggctggcccaggaggaggagagtgaggccaagcgcctggccattatgatgatgaag aagcgggagaagtacctgtaccagaagatcatgtttggcaagaggcgaaaaatccgagag gccaacaagctggcggagaagcggaaagcccacgatgaggcggtgaggtctgagaagaag gccaagaaggcaaggccggagtga >gi568815576r:30563440_30764003|GENSCAN_predicted_peptide_2|772_aa MRHLGAFLFLLGVLGALTEMCEIPEMDSHLVEKLGQHLLPWMDRLSLEHLNPSIYVGLRL SSLQAGTKEDLYLHSLKLGYQQCLLGSAFSEDDGDCQGKPSMGQLALYLLALRANCEFVR GHKGDRLVSQLKWFLEDEKRAIGHDHKGHPHTSYYQYGLGILALCLHQKRVHDSVVDKLL YAVEPFHQGHHSVAHCMFCPPLQDTAAMAGLAFTCLKRSNFNPGRRQRITMAIRTVREEI LKAQTPEGHFGNVYSTPLALQFLMTSPMRGAELGTACLKARVALLASLQDGAFQNALMIS QLLPVLNHKTYIDLIFPDCLAPRVMLEPAAETIPQTQEIISVTLQVLSLLPPYRQSISVL AGSTVEDVLKKAHELGGFTYETQASLSGPYLTSVMGKAAGEREFWQLLRDPNTPLLQASL VRMCRCPPEHHDGRMTSAEVGAAAGGAQAAGPPEWPPGSPQALRQPGRARVAMAALVWLL AGASMSSLNKWIFTVHGFGRPLLLSALHMLVAALACHRGARRPMPGGTRCRVLLLSLTFG TSMACGNVGLRAVPLDLAQLVTTTTPLFTLALSALLLGRRHHPLQLAAMGPLCLGAACSL AGEFRTPPTGCGFLLAATCLRGLKSVQQSALLQEERLDAVTLLYATSLPSFCLLAGAALV LEAGVAPPPTAGDSRLWACILLSCLLSVLYNLASFSLLALTSALTVHVLGNLTVVGNLIL SRLLFGSRLSALSYVGIALTLSGMFLYHNCEFVASWAARRGLWRRDQPSKGL >gi568815576r:30563440_30764003|GENSCAN_predicted_CDS_2|2319_bp atgaggcaccttggggccttcctcttccttctgggggtcctgggggccctcactgagatg tgtgaaataccagagatggacagccatctggtagagaagttgggccagcacctcttacct tggatggaccggctttccctggagcacttgaaccccagcatctatgtgggcctacgcctc tccagtctgcaggctgggaccaaggaagacctctacctgcacagcctcaagcttggttac cagcagtgcctcctagggtctgccttcagcgaggatgacggtgactgccagggcaagcct tccatgggccagctggccctctacctgctcgctctcagagccaactgtgagtttgtcagg ggccacaagggggacaggctggtctcacagctcaaatggttcctggaggatgagaagaga gccattgggcatgatcacaagggccacccccacactagctactaccagtatggcctgggc attctggccctgtgtctccaccagaagcgggtccatgacagcgtggtggacaaacttctg tatgctgtggaacctttccaccagggccaccattctgtggctcattgcatgttctgtccc ccacttcaagacacagcagccatggcaggcttggcattcacctgtctgaagcgctcaaac ttcaaccctggtcggagacaacggatcaccatggccatcagaacagtgcgagaggagatc ttgaaggcccagacccccgagggccactttgggaatgtctacagcaccccattggcatta cagttcctcatgacttcccccatgcgtggggcagaactgggaacagcatgtctcaaggcg agggttgctttgctggccagtctgcaggatggagccttccagaatgctctcatgatttcc cagctgctgcccgttctgaaccacaagacctacattgatctgatcttcccagactgtctg gcaccacgagtcatgttggaaccagctgctgagaccattcctcagacccaagagatcatc agtgtcacgctgcaggtgcttagtctcttgccgccgtacagacagtccatctctgttctg gccgggtccaccgtggaagatgtcctgaagaaggcccatgagttaggaggattcacatat gaaacacaggcctccttgtcaggcccctacttaacctccgtgatggggaaagcggccgga gaaagggagttctggcagcttctccgagaccccaacaccccactgttgcaagcctcactg gtgcggatgtgccgctgcccgccggagcaccatgatggcaggatgacctcagccgaagta ggagcagcagctggtggtgctcaggcggctgggccccccgagtggccccctggcagccct caggccctccggcagcctggccgggcccgagtggccatggcagcactggtgtggctgctg gcgggagccagcatgtcaagcctcaacaagtggatcttcacagtgcacggctttgggcgg cccctgctgctgtcggccctgcacatgctggtggcagccctggcatgccaccggggggca cggcgccccatgccaggcggcactcgctgccgagtcctactgctcagtctcacctttggc acgtccatggcctgcggcaacgtgggcctaagggctgtgcccctggacctggcacaactg gttactaccaccacacctctgttcaccctggccctgtcggcgctgctgctgggccgccgc caccacccacttcagttggccgccatgggtccgctctgcctgggggccgcctgcagcctg gctggagagttccggacaccccctaccggctgtggcttcctgctcgcagccacctgcctc cgcggactcaagtcggttcagcaaagtgccctgctgcaggaggagaggctggacgcggtg accctgctttacgccacctcgctgcccagcttctgcctgctggcgggtgcagccctggtg ctggaggctggcgttgccccaccgcccactgctggcgactctcgcctctgggcctgcatc ctgctcagctgcctcctgtctgttctctataacctggccagcttctccctgctggccctc acctctgccctcaccgtccacgtcctgggcaacctcaccgtggtgggcaacctcatcctg tcccggctgttgtttggcagccgcctcagtgccctcagctacgtgggcatcgcactcact ctttcaggaatgttcctttaccacaactgcgagttcgtggcctcctgggctgcccgtcgg gggctgtggcggagggaccagcccagcaagggtctttga >gi568815576r:30563440_30764003|GENSCAN_predicted_peptide_3|422_aa MRTSSSPQPGVGSDHPNIPGCPTWGDKDPGEGTLEHVKAEDTLSQDTQERRTSKRSSAGK LSTGKNQDHEADSDEEEEDKCKKLTSDSECEEQLPEEMKERKTEKIQFRQPSVSGLSQIT KSLYISNGVAANNKLMLSSNQITMVINVSVEVVNTLYEDIQYMQVPVADSPNSRLCDFFD PIADHIHSVEMKQGRTLLHCAAGVSRSAALCLAYLMKYHAMSLLDAHTWTKSCRPIIRPN SGFWEQLIHYEFQLFGKNTVHMVSSPVGMIPDIYEKEVRLMIPLKKLKYLAFLHKWMNSN PSRGTYHFWAPTTSGSPAASSGGPRGMLPRETKEARPPKTASRCLTASHHPMTKKQVVVP AALQVVHLKPTRKFAYLGQLDHKVGWKYQTVTATLEEKRKEKAKIHYQNKKQLMGWPGAG PD >gi568815576r:30563440_30764003|GENSCAN_predicted_CDS_3|1269_bp atgaggacttcgtcgtccccacaacctggtgttgggtctgatcaccccaacattcctggc tgcccaacgtggggcgacaaagaccccggtgaaggaacactagagcatgtgaaagcggag gacacattgtcacaggacacccaagaacgtcgaacgtctaaaagaagctcggcaggaaag ctgagcactgggaagaaccaggatcatgaggcagattcagatgaggaagaggaggacaag tgtaaaaaactaacttcagattctgagtgtgaggaacagctaccggaggagatgaaagaa aggaaaactgaaaaaattcagttccggcagccctcagtcagcggcctctcgcagataacc aaaagcctgtatatcagcaatggtgtggccgccaacaacaagctcatgctgtctagcaac cagatcaccatggtcatcaatgtctcagtggaggtagtgaacaccttgtatgaggatatc cagtacatgcaggtacctgtggctgactcccctaactcacgtctctgtgacttctttgac cctattgctgaccatatccacagcgtggagatgaagcagggccgtactttgctgcactgt gctgctggtgtgagccgctcagctgccctgtgcctcgcctacctcatgaagtaccacgcc atgtccctgctggacgcccacacgtggaccaagtcatgccggcccatcatccgacccaac agcggcttttgggagcagctcatccactatgagttccaattgtttggcaagaacactgtg cacatggtcagttccccagtgggaatgatccctgacatctatgagaaggaagtccgtttg atgattccactaaagaagttgaagtacctggccttcctccacaagtggatgaacagcaac ccttcccgaggcacctaccacttctgggcccccaccacttccgggtccccagccgcatct tctggcggaccccgaggcatgctgccccgcgagaccaaggaagccaggccgcccaagacc gcctcaaggtgtttaacggcatcccaccaccctatgacgaaaaagcaggtggtggttcct gctgccctccaggttgtgcatctgaagcctacaagaaagtttgcctacctggggcagctg gatcacaaagttggctggaagtaccagacagtgacagccaccctggaggagaagaggaag gagaaggccaagatccactaccagaataagaaacagctcatgggatggccgggcgcgggt ccagactaa >gi568815576r:30563440_30764003|GENSCAN_predicted_peptide_4|298_aa MSIFKGNDSSGKHIKHADIYERLVAVSANHGGPPSPSLQHHHPPTPMSHPGASIPVVREQ TIAQHEPAALVALQIVGPLEKPALEAVQRQRGRGFNAGSDPDSGLRKVALPTARESGSDA ALVKGPAPTPELDSDPGRDPCSSSDGCPAPGSGSDVVSDTGSDLGTASETASDTCSDSGP RSGSGTGWGWGLGSGPEPDVEALMPGAAVWHDRQGTTVNSDESPRERPPQPPPRLGAAAF PIEPADPRAAERPVRPVEGALTTPVGGRAWWSVNSEGDGALQATVSASAQPCQALEEQ >gi568815576r:30563440_30764003|GENSCAN_predicted_CDS_4|897_bp atgagcatctttaagggtaatgacagtagtgggaaacacatcaaacatgctgatatctat gagagactagtggcagtgtccgccaaccatggaggccctccatcaccatccctgcagcat caccaccctccaacccccatgtcccaccctggcgcttccatacctgtagtaagagagcaa accattgcccagcacgaaccagcggcgctggtagcccttcagatagttggtccacttgag aagccagccctcgaagctgtccagaggcaaagaggcaggggctttaacgctggcagcgat cctgactcgggtctgagaaaggtcgcgctccccaccgcccgggagagcggctccgatgcg gccttagtgaagggcccagcccctacacctgagcttgactctgaccccggccgcgacccc tgcagcagttccgatggctgcccagcccctggctccggctcagacgtcgtctcggacaca ggttccgatcttggcactgcctccgaaactgcctccgacacctgttctgacagcggtccc cgctccggttcgggcacgggctggggctggggcttgggctccggcccggagccggacgtg gaagcgctcatgcccggcgccgccgtgtggcacgacaggcaggggacaaccgtgaacagc gacgagagcccgcgggagcggccgccacagccgccgcctcggctcggagccgccgctttc cccatagagccggccgacccgcgcgcggccgagcggccagtccggccggtggaaggagct ctgacaacacctgtgggcggaagagcttggtggtcggtcaactccgaaggggatggcgcg cttcaggccacggtctctgcttcggcccagccgtgccaggcgctggaggaacaatga >gi568815576r:30563440_30764003|GENSCAN_predicted_peptide_5|69_aa MADGFPSNFQKEKTIGIKFHDIGFGNDFLDMTPKAQATKAKTDKWDSIEGHNLSSLKDTI NKVKRKLME >gi568815576r:30563440_30764003|GENSCAN_predicted_CDS_5|210_bp atggcagatggtttcccatcaaattttcagaaagagaaaaccataggcataaagtttcat gacattggatttggcaatgatttcttggatatgacaccaaaagcacaggcaacaaaagca aaaacagacaaatgggactccatcgaaggacacaacttaagttccttgaaggacacaatc aacaaagtaaaaaggaaacttatggaatag >gi568815576r:30563440_30764003|GENSCAN_predicted_peptide_6|277_aa MASGRGEIKVTTQLIPLLDETSQRATETRNQGEMAHTCRGTINLSTAHIDTEDSCGILLT SGARSYHLKASSEVDRQQWITALELAKAKAVRVMNTHSGSEHLGQRVCIWWCHESGGSGN CKPLVGSGGQETQASVDQSGRSRNFLKIMQQVAVMGLVIRQQFSYDSQCEGFLLQLAGCG ALYELKLQEQGVAAPTEAGVIGPFAEHTYTCYAATFCSDPPSHADNTPQTQGICCVVTPS NHADHTPQTQDPSSFCCLHWHTILLPASVNAVPLVGK >gi568815576r:30563440_30764003|GENSCAN_predicted_CDS_6|831_bp atggcaagtggcagaggagagatcaaagtgaccacccagctcattcccttgctggatgag acttcacagcgagctacagagacacgaaatcagggtgaaatggcccacacgtgccgtgga accatcaacctgtccaccgcgcacattgacacggaggactcttgtggtatcttgctgacc agtggggccaggagctaccacctcaaggccagctcagaggtggaccggcagcagtggatc accgccctggagctggccaaggccaaggctgtccgcgtgatgaacactcattcaggtagt gagcacttgggacagcgggtgtgtatatggtggtgtcatgagtctggaggaagtgggaat tgcaagccactggtgggcagtggtggccaggagacccaggccagcgtggatcaatctgga aggtcaaggaacttcttgaaaataatgcagcaggtggcagtcatgggattagtcatccgg caacagttttcctacgacagccagtgcgaggggtttcttttgcagttggcagggtgcgga gccctatatgaactcaaacttcaggagcaaggtgtggcagctcccacagaggcaggggtg atagggccctttgctgagcacacgtacacatgctatgcagccacgttctgcagtgaccct cccagccatgcagacaacactcctcaaacccagggtatctgctgtgtggtgaccccttcc aaccatgcagaccacactcctcaaacgcaggacccatccagcttctgctgtcttcactgg cacaccatcttgctgccagccagcgtgaatgctgtacctcttgttggcaag