GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:49:35 Sequence gi568815597f:27673308_27922326 : 249019 bp : 45.13% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 432 427 6 1.05 1.02 Term - 14111 13849 263 0 2 78 47 268 0.351 17.39 1.01 Init - 16366 16309 58 1 1 58 70 48 0.713 1.47 1.00 Prom - 21600 21561 40 -1.16 2.00 Prom + 24450 24489 40 -4.46 2.01 Init + 52774 52854 81 0 0 103 91 149 0.981 17.67 2.02 Intr + 54165 54229 65 2 2 99 65 89 0.996 5.22 2.03 Intr + 59296 59350 55 1 1 80 127 34 0.993 5.48 2.04 Intr + 60724 60876 153 0 0 91 94 121 0.995 13.27 2.05 Intr + 65878 66075 198 0 0 3 35 145 0.191 0.25 2.06 Intr + 66688 67104 417 0 0 33 5 263 0.069 6.72 2.07 Intr + 71347 71504 158 0 2 138 91 67 0.840 10.81 2.08 Intr + 81888 82023 136 0 1 28 86 155 0.957 9.87 2.09 Intr + 86219 86320 102 1 0 95 62 87 0.966 7.17 2.10 Intr + 95624 95722 99 1 0 41 93 53 0.105 1.41 2.11 Intr + 99917 100171 255 1 0 77 68 233 0.166 17.84 2.12 Intr + 119634 119687 54 2 0 101 91 4 0.257 1.18 2.13 Intr + 128371 128508 138 0 0 67 84 74 0.280 5.56 2.14 Intr + 136939 136982 44 0 2 64 90 18 0.140 -3.36 2.15 Intr + 138856 138961 106 1 1 83 78 169 0.892 15.62 2.16 Intr + 144544 144616 73 0 1 121 76 63 0.988 7.38 2.17 Term + 146343 146473 131 1 2 129 47 72 0.093 5.54 2.18 PlyA + 146618 146623 6 1.05 3.00 Prom + 147395 147434 40 -2.46 3.01 Init + 157529 157584 56 1 2 100 105 61 0.965 9.34 3.02 Intr + 159449 159509 61 2 1 111 64 11 0.554 -0.26 3.03 Intr + 165392 165545 154 1 1 23 86 115 0.977 4.45 3.04 Intr + 167707 168067 361 2 1 75 46 173 0.778 6.08 3.05 Intr + 169140 169185 46 0 1 113 36 14 0.769 -2.89 3.06 Intr + 169879 170023 145 0 1 41 94 167 0.908 12.46 3.07 Intr + 173721 173785 65 1 2 109 97 -2 0.954 1.24 3.08 Term + 176786 177139 354 1 0 34 40 264 0.987 10.69 3.09 PlyA + 178341 178346 6 1.05 4.00 Prom + 191300 191339 40 -4.46 4.01 Init + 199265 199358 94 1 1 109 84 153 0.994 17.33 4.02 Intr + 203281 203421 141 2 0 79 86 189 0.787 18.12 4.03 Intr + 206337 206747 411 1 0 135 79 323 0.961 30.26 4.04 Intr + 208664 209736 1073 0 2 95 74 1049 0.523 94.15 4.05 Intr + 211988 212144 157 1 1 99 95 91 0.873 10.48 4.06 Term + 214338 214390 53 1 2 77 55 60 0.415 -0.81 4.07 PlyA + 216193 216198 6 1.05 5.10 PlyA - 217767 217762 6 1.05 5.09 Term - 218940 218856 85 2 1 150 36 104 0.999 8.53 5.08 Intr - 220799 220705 95 2 2 67 108 43 0.991 2.96 5.07 Intr - 221090 220983 108 2 0 40 91 52 0.689 1.18 5.06 Intr - 223814 223698 117 2 0 70 109 106 0.977 11.56 5.05 Intr - 224400 224326 75 0 0 66 103 29 0.772 1.91 5.04 Intr - 233734 233621 114 1 0 103 95 131 0.989 15.94 5.03 Intr - 233975 233874 102 2 0 107 91 39 0.985 6.47 5.02 Intr - 240862 240756 107 2 2 71 39 152 0.970 8.53 5.01 Intr - 241434 241115 320 2 2 65 33 254 0.711 13.20 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 87188 87274 87 1 0 96 48 75 0.926 1.96 S.002 Intr + 146343 146425 83 1 2 129 98 77 0.860 12.16 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:27673308_27922326|GENSCAN_predicted_peptide_1|106_aa MKLKESAVSDLGAKPSSDFRVLLSDWPGDNTLFQLKFMVKQLEKPDKKAEKHSYANQAKV KKSLQQKNVECTRVYTKNTICKENSSVNWLYMASAWTQWPPRCRQL >gi568815597f:27673308_27922326|GENSCAN_predicted_CDS_1|321_bp atgaaattgaaagagagtgctgtgtcagaccttggtgctaaaccaagttcagatttcagg gtgcttctgtcagactggccaggggacaataccctgttccagttgaaatttatggtgaag cagctggagaaaccagacaagaaggcagaaaagcactcctacgccaaccaggccaaagtg aagaagtcccttcagcagaaaaatgtagagtgcacccgtgtgtacaccaagaacaccatc tgcaaggagaacagcagcgtgaactggctctacatggcatccgcctggacacagtggccc ccgaggtgcagacagctgtga >gi568815597f:27673308_27922326|GENSCAN_predicted_peptide_2|754_aa MAALYACTKCHQRFPFEALSQGQQLCKECRIAHPVVKCTYCRTEYQQESKTNTICKKCAQ NVQLYGTPKPCQYCNIIAAFIGNKCQRCTNSEKKYGPPYSCEQCKQQCAFDRKDDRKKDA GGKAGAPIVCMLNQGLKHVSEGCVFAYCQVEEKPYWKDPNNDFRKNLEVTTVPTLLKYGT PQKLQPGQFRGQVPPYSRLWVHLDVMDGHFVPNVTFGYPVVESFQKQLVQDLFFDIHVMV SKLEQWVKSIAIAEASQYTFHLEATKYSEALIEDIWENGMKVGLTIKPGTTVEYLAPWTN QIDTALVITVEPGFGRQKFMDDMVDGKLLCWLCTLSYKRVLQKTKEQRKHLSSSSRAGHQ EKEQYSRLSGGGHYNSFSPDLALDSPGTDHFVIIAQLKEEVATLKKMLHQKDQMILEKEK KITELKADFQYQESQMRAKMNQMEKTHKEVTEQLQFGTSKNKYCYELHNNTKLETMQTSI NKRMDNQTFLRQPAGPGWRLLPVGERCRASRSQLLVMSYGPLDMYRNPGPSGPQLRDFSS IIQTCSGNIQRISQASEPGYRAGGRELSRGQAWVRLLCFGGKSQKLRYKEQRQQRLQKER LMNDFSAALNNFQAVQRRVSEKEKESIARARAGSRLSAEERQREEQLVSFDSHEEWNQMQ SQEDEVAITEQDLELIKERETAIRQLEADILDVNQIFKDLAMMIHDQGDLIDSIEANVES SEVHVERATEQLQRAAYYQVKAGTKESHSVLQTF >gi568815597f:27673308_27922326|GENSCAN_predicted_CDS_2|2265_bp atggcggcgctctacgcctgcaccaagtgccaccagcgcttccccttcgaggcgctgtct caggggcagcagctgtgcaaggaatgtcggattgcacaccctgttgtgaagtgcacctac tgcaggactgagtaccagcaggagagtaaaaccaatacaatatgcaagaaatgtgctcag aacgtgcagttgtatggaacgcccaaaccttgtcagtattgcaacataattgcagcattt attgggaataaatgccagcgctgcacaaattcagaaaagaagtatggaccaccctattct tgtgaacagtgcaagcagcagtgtgcatttgacaggaaagatgatagaaagaaggatgct ggaggaaaagctggtgcccctattgtatgcatgctgaaccaggggctgaagcatgttagt gaaggatgtgtgttcgcctactgccaagtagaagaaaagccttattggaaagatccaaat aatgacttcagaaaaaacttggaagtaaccacagtgcctacactacttaaatatggaaca cctcaaaaactgcaacctggccagtttaggggtcaagtgcctccgtattctagactctgg gtccacctggatgtcatggatgggcactttgttcccaacgtcacctttggttatcctgtg gtagaaagctttcaaaagcagctagtccaggaccttttctttgacatacatgtgatggtg tccaagctggaacagtgggtaaaatcaatagctatagcagaagccagtcagtacaccttt catcttgaggctactaagtactcagaggctttgattgaagacatttgggagaatgggatg aaggttggccttaccatcaaaccaggaactacagttgagtatttggcaccatggactaat caaatagatacggccttggttatcacagtggaacctgggtttggaaggcagaaattcatg gatgatatggtagatgggaaattgctgtgctggctgtgcacactttcatacaaacgggtc cttcagaagaccaaagagcagaggaaacacctgagtagctcttctcgtgctggccaccag gagaaggagcagtatagtcgcctgagtggtggtggccattataacagcttctccccagac ctggctctggactcaccaggcactgaccactttgtcatcattgcccaactgaaggaagaa gtggctaccctgaagaagatgttgcatcaaaaggatcaaatgattttagagaaagagaag aagattacagagttgaaggctgattttcagtaccaggaatcgcagatgagagccaaaatg aaccagatggagaaaacccacaaagaagtcacagaacaactgcagtttggaactagtaaa aataaatactgctatgaacttcataataacaccaaattggaaacaatgcaaacatccatc aataagagaatggataatcaaacttttctccgtcagcctgcgggtcccggctggcggctg cttccggtaggagagcggtgtagagcgagcaggtctcagctcctcgtcatgtcatacggt cccttagacatgtaccggaacccggggccctcggggccccagctccgggacttcagcagc atcatccagacgtgcagcggcaacatccagcggatcagccaagccagtgagccggggtac cgagctggggggcgggagctgtcccggggacaggcctgggtaaggcttctttgttttgga gggaagagccagaaactcaggtataaagagcagcgccagcagagacttcagaaggaacgc ctcatgaatgacttctctgcagccttaaacaatttccaggctgtgcagagaagggtatct gaaaaggaaaaggagagtattgccagagcaagagctggatctcgtctttctgcagaagag aggcaaagagaggagcagctggtctcatttgacagccatgaggagtggaaccagatgcag agccaggaggatgaggtggccatcactgagcaggatttggaacttattaaagaaagagaa acggcaattcggcagctggaggctgacattttggatgtcaatcagatatttaaagatttg gccatgatgatccatgaccagggtgatctgattgatagcatagaagccaatgtggaaagc tcagaggtgcacgtcgaaagagccactgaacagttacagcgagctgcttactatcaggta aaagcgggtaccaaagaaagtcactctgtgttgcagactttttag >gi568815597f:27673308_27922326|GENSCAN_predicted_peptide_3|413_aa MAAAANSGSSLPLFDCPTWAGKPPPGLHLDVVKGDKLIEKLIIDEKKYYLFGRNPDLCDF TIDHQSCSRVHAALVYHKHLKRVFLIDLNSTHGTFLGHIRLEPHKPQQIPIDSTVSFGAS TRAYTLREKPQTLPSAVKGDEKMGGEDDELKGLLGLPEEETELDVIPCLCHCFVFGDPGF WSIQLFYVAGNASDLAAGGSLESSNGPLAIWPPYPGCCYTSMAFSSNLTEFNTAHNKRIS TLTIEEGNLDIQRPKRKRKNSRVTFSEDDEIINPEDVDPSVGRFRNMVQTAVVPVKKKRV EGPGSLGLEESGSRRMQNFAFSGGLYGGLPPTHSEAGSQPHGIHGTALIGGLPMPYPNLA PDVDLTPVVPSAVNMNPAPNPAVYNPEAVNEPKKKKYAKEAWPGKKPTPSLLI >gi568815597f:27673308_27922326|GENSCAN_predicted_CDS_3|1242_bp atggcggcagccgcgaactccggctctagcctcccgctgttcgactgcccaacctgggca ggtaagccccctcccggtttacatctggatgtagtcaaaggagacaaactaattgagaaa ctgattattgatgagaagaagtattacttatttgggagaaaccctgatttgtgtgacttt accattgaccaccagtcttgctctcgggtccatgctgcacttgtctaccacaagcatctg aagagagttttcctgatagatctcaacagtacacacggcactttcttgggtcacattcgg ttggaacctcacaagcctcagcaaattcccatcgattccacggtctcatttggcgcatcc acaagggcatacactctgcgcgagaagcctcagacattgccatcggctgtgaaaggagat gagaagatgggtggagaggatgatgaactcaagggcttactggggcttccagaggaggaa actgagcttgatgtaattccctgtttatgtcattgttttgtctttggggaccctggtttt tggagtatacagctgttctatgtagctggaaatgccagtgacttagcagcagggggttct cttgagtccagtaatgggcctttggcaatctggccaccttatccaggctgctgctatact tccatggcttttagttcaaacctgacagagttcaacactgcccacaacaagcggatttct acccttaccattgaggagggaaatctggacattcaaagaccaaagaggaagaggaagaac tcacgggtgacattcagtgaggatgatgagatcatcaacccagaggatgtggatccctca gttggtcgattcaggaacatggtgcaaactgcagtggtcccagtcaagaagaagcgtgtg gagggccctggctccctgggcctggaggaatcagggagcaggcgcatgcagaactttgcc ttcagcggaggactctacgggggcctgccccccacacacagtgaagcaggctcccagcca catggcatccatgggacagcactcatcggtggcttgcccatgccatacccaaaccttgcc cctgatgtggacttgactcctgttgtgccgtcagcagtgaacatgaaccctgcaccaaac cctgcagtctataaccctgaagctgtaaatgaacccaagaagaagaaatatgcaaaagag gcttggccaggcaagaagcccacaccttccttgctgatttga >gi568815597f:27673308_27922326|GENSCAN_predicted_peptide_4|642_aa MEPVPLQDFVRALDPASLPRVLRVCSGVYFEGSIYEISGNECCLSTGDLIKVTQVRLQKV VCENPKTSQTMELAPNFQGYFTPLNTPQSYETLEELVSATTQSSKQLPTCFMSTHRIVTE GRVVTEDQLLMLEAVVMHLGIRSARCVLGMEGQQVILHLPLSQKGPFWTWEPSAPRTLLQ VLQDPALKDLVLTCPTLPWHSLILRPQYEIQAIMHMRRTIVKIPSTLEVDVEDVTASSRH VHFIKPLLLSEVLAWEGPFPLSMEILEVPEGRPIFLSPWVGSLQKGQRLCVYGLASPPWR VLASSKGRKVPRHFLVSGGYQGKLRRRPREFPTAYDLLGAFQPGRPLRVVATKDCEGERE ENPEFTSLAVGDRLEVLGPGQAHGAQGSDVDVLVCQRLSDQAGEDEEEECKEEAESPERV LLPFHFPGSFVEEMSDSRRYSLADLTAQFSLPCEVKVVAKDTSHPTDPLTSFLGLRLEEK ITEPFLVVSLDSEPGMCFEIPPRWLDLTVVKAKGQPDLPEGSLPIATVEELTDTFYYRLR KLPACEIQAPPPRPPKNQGLSKQRRHSSEGGVKSSQVLGLQQHARLPKPKAKTLPEFIKD GSSTYSKIPAHRKGHRPAKPQRQDLAPMPFSREVNLRCKGGP >gi568815597f:27673308_27922326|GENSCAN_predicted_CDS_4|1929_bp atggagccggtgccgctgcaggacttcgtgcgcgccttggaccccgcctccctcccgcgc gtgctgcgggtctgctcgggggtctacttcgagggctccatctatgagatctctgggaat gagtgctgcctctccacgggggacctgatcaaggtcacccaggtccgcctccagaaggtg gtctgtgagaacccgaagaccagccagaccatggagctcgcccccaacttccagggctac ttcacccccctcaacaccccacagagctatgaaaccctggaggagctggtctctgccaca actcagagctccaagcagctgcccacttgcttcatgtcgacccacaggattgtcacagag ggcagggtggtgactgaggaccagctcctcatgcttgaggctgtggtgatgcacctcggg atccgctctgcccgctgtgtcctgggcatggagggtcagcaggtcatcctgcacctgccc ctatcccagaaggggcccttctggacatgggagcctagtgcccctcgaactctgctccag gtcctacaggatccagccctgaaagacctcgtcctcacctgccccaccctgccctggcat tccctgatcctgcggccccagtatgagatccaagccatcatgcacatgcgcaggaccatt gtcaagatcccttctaccctggaggtcgacgtggaggacgtcaccgcctcctcccggcac gtccactttatcaaaccgctgctgctgagcgaggtcctggcctgggaaggccctttcccc ctgtccatggagatcctggaggttcctgagggccgccccatcttcctcagcccgtgggtg ggctccttgcaaaaaggccagaggctttgcgtctatggcctagcctcaccaccctggcgg gtcctggcctcaagcaagggccgcaaggtgcccaggcacttcctggtgtcagggggctac caaggcaagctgcggcggcggccaagggagttccccacggcctatgacctcctaggtgct ttccagccaggccggccactccgggtggtggccacaaaggactgtgagggcgagagggag gagaatcccgagttcacgtccctggctgtgggtgaccggctggaggtgctggggcctggc caggcccatggggcccagggcagtgacgtggatgtcttggtttgtcagcggctgagtgac caggctggggaggatgaggaggaagagtgcaaagaggaggcagagagcccagagcgggtc ctgctgcccttccacttccctggcagtttcgtggaggagatgagtgacagccggcgctac agcctggcagatctgactgcccagttttcactgccttgtgaggtcaaggtggtggccaag gacaccagccaccccactgaccctctgacctccttcctgggcctgcggctggaggagaag atcacagagccattcttggtggtgagcctagactctgagcctgggatgtgctttgagatc cctccccggtggctggacctgactgttgtgaaggccaaggggcagccagacttgccagag gggtctctccccatagccacagtggaggagctgacagacaccttctattatcgtcttcgg aagttaccagcctgtgagatccaagcccccccacccaggccccctaaaaatcagggcctc agcaagcagaggagacacagcagtgagggaggcgtcaagtcttctcaagtcttaggattg cagcaacacgctcggctgcccaaacccaaggcgaagaccttgccagagttcatcaaggat ggctccagtacgtacagcaagattcctgcccacaggaagggccacaggcccgctaagccc caaaggcaggatctagctcccatgcccttctccagagaagtcaacctgcgctgcaaagga ggaccctga >gi568815597f:27673308_27922326|GENSCAN_predicted_peptide_5|374_aa XAEGFALRKDGRGGFWSQLGAQYAFSGAWAEETGTSVASVSTDWLKRAPRELTGSAIRGK AFVVPEKSSQSGAVAAAFCGFPLFPQTRTFSASLRRIVTKMWNSNDGGGFESYGSSSYGG AGGYTQSPGGFGSPAPSQAEKKSRARAQHIVPCTISQLLSATLVDEVFRIGNVEISQVTI VGIIRHAEKAPTNIVYKIDDMTAAPMDVRQWVDTDDTSSENTVVPPETYVKVAGHLRSFQ NKKSLVAFKIMPLEDMNEFTTHILEVINAHMVLSKANSQPSAGRAPISNPGMSEAGNFGG NSFMPANGLTVAQNQVLNLIKACPRPEGLNFQDLKNQLKHMSVSSIKQAVDFLSNEGHIY STVDDDHFKSTDAE >gi568815597f:27673308_27922326|GENSCAN_predicted_CDS_5|1125_bp nntgcggagggttttgcccttcgtaaagatggccgcggaggcttttggagccaactggga gcgcagtacgcgttttctggagcatgggcagaggagacaggaacaagcgtagcatccgtg agcaccgattggctgaagcgagcaccccgggagctgactggctccgccattcgcgggaag gcgtttgtggtgccagagaaaagtagccagagcggcgcagtggcggccgcgttctgtggt tttccgctattcccccagacccgcaccttctcggcctctttgcggagaatcgtgaccaag atgtggaacagtaatgatgggggtggattcgaaagctatggcagctcctcatacggggga gccggcggctacacgcagtccccggggggctttggatcgcccgcaccttctcaagccgaa aagaaatcaagagcccgagcccagcacattgtgccctgtactatatctcagctgctttct gccactttggttgatgaagtgttcagaattgggaatgttgagatttcacaggtcactatt gtggggatcatcagacatgcagagaaggctccaaccaacattgtttacaaaatagatgac atgacagctgcacccatggacgttcgccagtgggttgacacagatgacaccagcagtgaa aacactgtggttcctccagaaacatatgtgaaagtggcaggccacctgagatcttttcag aacaaaaagagcctggtagcctttaagatcatgcccctggaggatatgaatgagttcacc acacatattctggaagtgatcaatgcacacatggtactaagcaaagccaacagccagccc tcagcagggagagcacctatcagcaatccaggaatgagtgaagcagggaactttggtggg aatagcttcatgccagcaaatggcctcactgtggcccaaaaccaggtgttgaatttgatt aaggcttgtccaagacctgaagggttgaactttcaggatctcaagaaccagctgaaacac atgtctgtatcctcaatcaagcaagctgtggattttctgagcaatgaggggcacatctat tctactgtggatgatgaccattttaaatccacagatgcagaataa