GENSCAN 1.0 Date run: 6-Nov-116 Time: 01:52:45 Sequence gi568815578r:44994873_45198697 : 203825 bp : 40.91% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 218 385 168 1 0 103 90 200 0.757 20.92 1.02 Intr + 2297 2434 138 1 0 65 80 89 0.974 5.54 1.03 Intr + 5520 5648 129 2 0 80 70 160 0.995 13.37 1.04 Intr + 6295 6481 187 0 1 87 78 276 0.862 24.84 1.05 Intr + 12279 12359 81 1 0 79 94 18 0.244 0.19 1.06 Intr + 30101 30258 158 0 2 118 78 155 0.468 16.31 1.07 Term + 54006 54032 27 2 0 111 41 24 0.003 -2.90 1.08 PlyA + 54436 54441 6 1.05 2.02 PlyA - 55456 55451 6 1.05 2.01 Sngl - 62351 61998 354 2 0 76 45 196 0.920 9.90 2.00 Prom - 68851 68812 40 -1.45 3.00 Prom + 71260 71299 40 -6.25 3.01 Init + 75093 75149 57 2 0 64 45 52 0.115 -0.14 3.02 Term + 80146 80304 159 0 0 108 53 331 0.993 28.56 3.03 PlyA + 80657 80662 6 -0.45 4.05 PlyA - 81359 81354 6 -3.64 4.04 Term - 81773 81595 179 0 2 51 41 167 0.448 5.17 4.03 Intr - 90679 90511 169 1 1 55 83 131 0.968 7.90 4.02 Intr - 92303 92156 148 1 1 64 79 55 0.468 1.52 4.01 Init - 94329 94250 80 0 2 72 113 40 0.840 5.48 4.00 Prom - 96373 96334 40 -9.85 5.12 PlyA - 97455 97450 6 1.05 5.11 Term - 100468 99998 471 1 0 84 41 564 0.653 45.24 5.10 Intr - 103823 102790 1034 0 2 103 102 1519 0.985 144.12 5.09 Intr - 104367 104289 79 0 1 42 107 94 0.722 4.91 5.08 Intr - 105362 105288 75 2 0 16 94 108 0.276 2.99 5.07 Intr - 110272 110024 249 1 0 3 31 224 0.400 4.91 5.06 Intr - 110406 110358 49 2 1 68 53 82 0.783 0.46 5.05 Intr - 112438 112307 132 0 0 93 83 35 0.708 2.34 5.04 Intr - 113367 113222 146 0 2 109 39 56 0.382 0.96 5.03 Intr - 116761 116627 135 1 0 10 8 213 0.423 5.34 5.02 Intr - 118625 118534 92 0 2 36 64 86 0.396 -0.21 5.01 Init - 120211 120127 85 1 1 57 105 151 0.986 12.75 5.00 Prom - 123273 123234 40 -8.65 6.04 PlyA - 123284 123279 6 1.05 6.03 Term - 124709 124600 110 0 2 51 43 126 0.834 2.09 6.02 Intr - 129393 129235 159 1 0 86 116 146 0.999 16.24 6.01 Init - 129575 129497 79 2 1 112 84 137 0.998 15.47 6.00 Prom - 129628 129589 40 -6.15 7.07 PlyA - 130979 130974 6 1.05 7.06 Term - 135713 135648 66 2 0 82 34 41 0.184 -4.94 7.05 Intr - 140575 140445 131 2 2 20 92 109 0.373 3.99 7.04 Intr - 140904 140657 248 2 2 31 77 148 0.382 4.18 7.03 Intr - 145103 144979 125 2 2 61 76 21 0.128 -3.34 7.02 Intr - 145775 145478 298 1 1 23 72 211 0.206 8.95 7.01 Init - 147385 147171 215 1 2 24 84 201 0.873 11.56 7.00 Prom - 153016 152977 40 -6.05 8.00 Prom + 157926 157965 40 -7.25 8.01 Sngl + 160993 161358 366 0 0 89 54 178 0.378 10.34 8.02 PlyA + 161743 161748 6 1.05 9.00 Prom + 173146 173185 40 -3.65 9.01 Init + 180051 180129 79 2 1 98 105 157 0.998 19.87 9.02 Intr + 180989 181259 271 0 1 87 53 217 0.377 13.78 9.03 Term + 181498 181654 157 1 1 137 43 24 0.280 -0.78 9.04 PlyA + 181941 181946 6 1.05 10.05 PlyA - 183030 183025 6 1.05 10.04 Term - 186212 186163 50 0 2 103 49 25 0.124 -3.41 10.03 Intr - 187555 187346 210 2 0 82 32 144 0.219 6.16 10.02 Intr - 197609 197451 159 0 0 89 109 31 0.098 4.34 10.01 Init - 200727 200595 133 0 1 78 59 29 0.123 -0.65 10.00 Prom - 200851 200812 40 -1.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:44994873_45198697|GENSCAN_predicted_peptide_1|295_aa DTMAKRNTVIGTPFWMAPEVIQEIGYNCVADIWSLGITAIEMAEGKPPYADIHPMRAIFM IPTNPPPTFRKPELWSDNFTDFVKQCLVKSPEQRATATQLLQHPFVRSAKGVSILRDLIN EAMDVKLKRQESQQREVDQDDEENSEEDEMDSGTMVRAVGDEMGTVRVASTMTDGANTMI EHDDTLPSQLGTMVINAEDEEEEGTMKKMMFLHFLYKGTGSERLRLLLVHGHMARRDETM QPAKPSFLEYFEQKEKENQINSFGKSVPGPLKNSSDWKIPQDGDYEFGLNIKDTH >gi568815578r:44994873_45198697|GENSCAN_predicted_CDS_1|888_bp gataccatggccaagcggaatacagtgataggaacaccattttggatggctccagaagtg attcaggaaattggatacaactgtgtagcagacatctggtccctgggaataactgccata gaaatggctgaaggaaagcccccttatgctgatatccatccaatgagggcaatcttcatg attcctacaaatcctcctcccacattccgaaaaccagagctatggtcagataactttaca gattttgtgaaacagtgtcttgtaaagagccctgagcagagggccacagccactcagctc ctgcagcacccatttgtcaggagtgccaaaggagtgtcaatactgcgagacttaattaat gaagccatggatgtgaaactgaaacgccaggaatcccagcagcgggaagtggaccaggac gatgaagaaaactcagaagaggatgaaatggattctggcacgatggttcgagcagtgggt gatgagatgggcactgtccgagtagccagcaccatgactgatggagccaatactatgatt gagcacgatgacacgttgccatcacaactgggcaccatggtgatcaatgcagaggatgag gaagaggaaggaactatgaaaaagatgatgtttctccactttctctataaaggaacaggt tcagagagattgagattgttgcttgttcatggtcacatggctagaagggatgagaccatg cagcctgcgaaaccatcctttcttgaatattttgaacaaaaagaaaaggaaaaccagatc aacagctttggcaagagtgtacctggtccactgaaaaattcttcagattggaaaatacca caggatggagactacgagtttggtttaaatatcaaggatactcactaa >gi568815578r:44994873_45198697|GENSCAN_predicted_peptide_2|117_aa MGDGNVGVRTIDTVAQRQPRTWSFCDLADSAYPNVLGAQLTWPLKPPHGSCVLKASWPSQ GPHPATHRMASNPLDGLLSASTPEGKRRLAQRLWISSSLVKPANSSVIPWAEPHLLQ >gi568815578r:44994873_45198697|GENSCAN_predicted_CDS_2|354_bp atgggagatggcaatgtgggtgtgaggaccatcgacactgtggcccagcgacagcctaga acgtggtctttctgtgaccttgcagactcagcctatcctaacgttttgggagctcagctg acatggccactaaagcctccacatggctcttgtgttctaaaggcctcctggcccagccag ggcccacaccctgctacccaccggatggccagtaacccactggatggcctgctctctgcc agcactccagaaggtaaacgacggcttgcccagagactttggatcagctccagtcttgtc aaaccagcaaactcctctgtgattccatgggctgaaccacatcttctccaatga >gi568815578r:44994873_45198697|GENSCAN_predicted_peptide_3|71_aa MVRKGLPASSEELTSKLKPLKSWTVEDLQKRLLALDPMMEQEIEEIRQKYQSKRQPILDA IEAKKRRQQNF >gi568815578r:44994873_45198697|GENSCAN_predicted_CDS_3|216_bp atggtcagaaaaggcttgcctgcatcctctgaggagttgacatctaaactaaaacctctt aagagttggacagtggaggaccttcagaagaggctcttggccctggaccccatgatggag caggagattgaagagatccggcagaagtaccagtccaagcggcagcccatcctggatgcc atagaggctaagaagagacggcaacaaaacttctga >gi568815578r:44994873_45198697|GENSCAN_predicted_peptide_4|191_aa MATDPFGEGKRNIHITLMQYSRVLPQSLRKYPTFPRGYGVYNSTLVPISVSVRNFWLQTT ESNSGKLKPKKDCIGKAPLKEVALKCASYNEFLLSEGSGNSPEVLAGMVLNTRPHTSGQS GPACVERILQEVSPKPASMGACLLYLVDVVELKDSLKLNDLARPGNLFTARSSTEEPRQE QVSVSKVPWNN >gi568815578r:44994873_45198697|GENSCAN_predicted_CDS_4|576_bp atggccacagatccttttggagaaggcaagagaaatattcacattacactcatgcaatat agcagagtccttcctcaaagtcttaggaaatatcccacatttcctagaggatatggtgtc tacaactcaactctagttccaatttctgtatcagttaggaatttttggttgcaaacaaca gaatccaactctggaaaacttaagccaaaaaaggactgcattggaaaggcaccactgaag gaagtggctttgaagtgtgcgtcctataatgagtttcttctatctgaaggctctggaaac agccctgaggtgctggctgggatggtcctcaataccagacctcatacttcaggccagtcc ggccctgcctgtgtggaaagaattctccaggaagttagcccgaaacctgcctccatgggt gcttgtctgctatatctggtagatgttgtggagctcaaggactcgctgaaactgaatgat ctggctcgtccagggaatctctttacagccaggtcatctactgaagagccaaggcaagag caagtttctgttagcaaggttccctggaacaattaa >gi568815578r:44994873_45198697|GENSCAN_predicted_peptide_5|848_aa MRTQSLLLLGALLAVGSQLPAVFGRKKGGSTPQYSAQLLRGKELAQQKPSLRRATLHGQR NASETELTPDPRNHAICNSDERNGAEDVASCGCDSAPGWPLLTHEGGGKFPALSAGPLVS STRLRDWAVHCGGTLGNPWSSFPVLLTLQYDDRASALCHEKSLPGATCLAGETPGASLKL SQTSQASNLPKRKKYLWKNGYRSTEVHQEEKKPVLQVLPVRSHSRLWQPQESPRYQLSGG KTYQEPVCTKAVKDPRGAVGTNESHKNGGGGSDKEPGSSGLFMSSCTYPGLPKKKKYDAT RFHRQGNKPREGDECTQGQQLVMLMLLVRGTHYENLRSKVVLPTPLGGRSTETFVSEFPG PDTGIRWRRSDEALRVNVGGVRRQLSARALARFPGTRLGRLQAAASEEQARRLCDDYDEA AREFYFDRHPGFFLSLLHFYRTGHLHVLDELCVFAFGQEADYWGLGENALAACCRARYLE RRLTQPHAWDEDSDTPSSVDPCPDEISDVQRELARYGAARCGRLRRRLWLTMENPGYSLP SKLFSCVSISVVLASIAAMCIHSLPEYQAREAAAAVAAVAAGRSPEGVRDDPVLRRLEYF CIAWFSFEVSSRLLLAPSTRNFFCHPLNLIDIVSVLPFYLTLLAGVALGDQGGKEFGHLG KVVQVFRLMRIFRVLKLARHSTGLRSLGATLKHSYREVGILLLYLAVGVSVFSGVAYTAE KEEDVGFNTIPACWWWGTVSMTTVGYGDVVPVTVAGKLAASGCILGGILVVALPITIIFN KFSHFYRRQKALEAAVRNSNHQEFEDLLSSIDGVSEASLETSRETSQEGQSADLESQAPS EPPHPQMY >gi568815578r:44994873_45198697|GENSCAN_predicted_CDS_5|2547_bp atgaggacccagagccttctcctcctgggggccctcctggctgtggggagtcagctgcct gctgtctttggcaggaagaagggagggagcacaccccaatattcagcccagctgctcaga ggcaaagagctggcacagcagaaaccgagcctgcgcagggccactctgcatgggcagagg aacgcctcagagacagagttaacgcctgatcctcgaaaccatgccatttgcaacagtgat gaacgcaacggtgcagaagacgtggccagctgtggctgtgacagtgcaccaggctggcca ctgctgacgcacgaaggaggtggcaaattccctgctctatctgctgggcctcttgtgtcc agcacaagactgagggactgggcggtccactgtggaggtacacttgggaatccttggtct tctttccctgttctcttgaccttgcagtatgatgacagagcttctgccctctgccatgag aagagcttgcccggagccacttgtcttgctggagagacacctggggcaagcctgaagctg agccagaccagccaagcctcaaacttgccaaagagaaaaaaatatttgtggaaaaatggc tataggagtacagaagttcaccaggaagagaagaagccagtacttcaggttctccctgtg cggagccatagcaggctctggcagcctcaggaaagccccagataccagctgtcaggagga aaaacataccaggagcctgtgtgcacaaaagcggtaaaggatcccagaggagctgtgggc acgaatgagtcacataagaatggtggaggaggaagtgacaaggagcctgggtcttcaggg ctattcatgagcagctgtacctaccctggcctgccaaaaaaaaaaaaatatgatgctacc cggttccacagacagggaaacaagcccagagagggcgacgaatgcactcaaggacagcag ctagtgatgctgatgctgctggtccggggaacacactatgagaacctccggtctaaagtg gtgctgccaacacccctaggagggaggagcactgaaacctttgtgagcgagttcccgggc cccgacaccgggatccgctggcggcgaagcgacgaggcgctgcgcgtgaacgtgggtggc gtgcggcggcagctgagcgcgcgcgccctggcgcgcttcccgggcacgcggctgggccgc ctgcaggccgcggcgtcggaggagcaggcgcggcgcctgtgcgacgactacgacgaggcg gcgcgcgaattctacttcgaccggcacccgggcttcttcctgagcctgctgcacttctac cgcactggccacctgcacgtgctcgacgagctgtgcgtcttcgcctttggccaggaggcc gactactggggcctaggcgagaacgcgcttgccgcgtgctgccgcgcgcgctacctggag aggcggctgacccagccgcacgcctgggacgaggacagcgacacgccgagcagcgtggac ccgtgccccgacgagatctccgacgtgcagcgagaactggcgcgctatggcgcggcgcgc tgtggccgcctgcgccgccgcctctggctgaccatggagaacccgggctactcgctgccg agcaagctcttcagctgcgtctccatcagcgtggtgctcgcctccatcgccgccatgtgc atccacagcctgcccgagtaccaggcccgcgaggcggcggccgccgtggctgcggtggcc gcgggccgcagcccggaaggcgtgcgcgacgacccggtgctgcgacgcctcgagtacttc tgcatcgcctggttcagcttcgaggtgtcgtcgcgcctcctgctggcgcccagtacgcgc aacttcttctgccacccgctcaacctcatcgacattgtgtctgtgctgcccttctatctc acgctgctggctggtgtggcactgggcgaccagggcggcaaggagttcggccacctgggc aaggtggtgcaggtgttccgcctcatgcgcatcttccgcgtactcaagttggcgcgccat tccaccgggctgcgctcgctgggagccacgctcaagcacagctaccgtgaggtgggcatc ttgctgctgtacctggctgtgggtgtgtcagtgttctctggtgtggcctacacagctgaa aaggaggaggacgtgggctttaacaccatcccagcctgctggtggtggggcacagtgagc atgaccaccgtgggctatggggatgtggtgccagtgacggtggctggcaagctggcagcc tcaggctgcatcctagggggcatcctggtggtagcactccccatcaccatcatcttcaac aagttctcccacttctaccggcgccagaaggctctggaggcagccgtgcgcaacagcaac caccaagagtttgaggacttgctgagcagcattgatggggtgtcggaggcatctctggag acatcccgagaaacctctcaggagggacagtctgcagatctagagagccaggcccccagt gagcctccacaccctcagatgtattaa >gi568815578r:44994873_45198697|GENSCAN_predicted_peptide_6|115_aa MGSSSFLVLMVSLVLVTLVAVEGVKEGIEKAGVCPADNVRCFKSDPPQCHTDQDCLGERK CCYLHCGFKCVIPVKELEEETTTLIIQQPQHQGKTLRQQKDDDDSLKAYMIISIF >gi568815578r:44994873_45198697|GENSCAN_predicted_CDS_6|348_bp atggggtccagcagcttcttggtcctcatggtgtctctcgttcttgtgaccctggtggct gtggaaggagttaaagagggtatagagaaagcaggggtttgcccagctgacaacgtacgc tgcttcaagtccgatcctccccagtgtcacacagaccaggactgtctgggggaaaggaag tgttgttacctgcactgtggcttcaagtgtgtgattcctgtgaaggaactggaagaagaa accactaccctgatcattcagcagcctcaacatcaaggtaagaccctccgccagcagaaa gatgatgatgactcactgaaggcttacatgattattagcatattttag >gi568815578r:44994873_45198697|GENSCAN_predicted_peptide_7|360_aa MWRTKEHNEAGWLLLSSVDEVMKENDELRDSISQLQKQILSLKSAKIALTESLISFRERA EIVEKQTQALIMGTVYWGKGNDQTFQGLLDTGSELTLIPGDPKHHCGPPGPKVRAYGDQV INGVLAQVQLIVGPVGPWTHPVVISPVPECVIGIAILNSWQNPHIGSLTGRAHQKQFAFS WQGQQYTFTVLPQRYINFPALCHKTAKRHTARRKLASICNARIQTSPCRPRNYNCHNMRP NPDSRPDSVFSESRRQRVQDCPNRPRIQAHPSRPKYQASSPCTEDPVPSLKTHAQGSSQY LASPFRLRLKGHPKSKPAPMDPGFRLAPVNTDSRPNPTQEAFAPLSGSVEGGEGVQMIIS >gi568815578r:44994873_45198697|GENSCAN_predicted_CDS_7|1083_bp atgtggagaaccaaggaacataatgaagctggttggttgctcctaagttcagtggacgaa gtgatgaaagaaaatgatgaactcagggattctatctcccagcttcagaagcagatactg agtctcaaatctgctaagattgccctgactgagagtcttatctcctttagagaaagagct gaaattgtggaaaaacagacacaagctcttatcatgggaactgtgtactggggaaaggga aatgatcagacatttcagggactactggacactggctctgagctgacgttgattccaggg gacccaaaacatcattgtggtcctccaggaccaaaagtaagggcttatggagatcaggta attaacggagttttggctcaggtccaacttatagtgggtccagtgggcccctggactcat cctgtggtcatttccccagtgccagaatgcgtaattggcatagccatacttaacagctgg cagaacccccacattggctccctgactggtagggcccaccagaagcaatttgccttcagc tggcaaggccagcaatatacctttactgtcctacctcaaagatatatcaactttccagct ttgtgtcataaaacagcaaaaagacacacagctaggaggaaactggccagcatctgcaat gccaggatccagacaagcccctgcagacccaggaactacaactgccacaacatgaggcca aacccagactctagaccagactctgtgttctcagagtccagaagacaaagggtccaggat tgccctaacagacccaggatccaggcccatcccagtagacccaaataccaggccagttcc ccatgcactgaggatccagtaccatccctgaagacccatgctcaaggctcatctcagtac ttggccagccctttcagactcaggctcaagggccaccccaagagcaagccagcccccatg gacccaggcttcaggctagctcctgtgaatacagattccaggcccaaccccacccaggaa gcatttgcacctttgtctggctcagtggaaggtggagaaggggttcagatgatcatttcc tga >gi568815578r:44994873_45198697|GENSCAN_predicted_peptide_8|121_aa MAKRGQDTAQAMASEGASSKPGELPCGVEPVGAQNSRIEIWEPLPRFQRMYGNVWMSRQK FASGAGPSWRTSARAVWKGNMRSESPHRVPTGSLPNGAVRRGPPSCRPQNGRSTDSLQCA P >gi568815578r:44994873_45198697|GENSCAN_predicted_CDS_8|366_bp atggctaaaaggggccaagatacagctcaggccatggcttcagagggtgcaagctccaag cctggggagcttccttgtggtgttgagcctgtgggtgcacagaactcaagaattgagatt tgggaacctttgcctagatttcagaggatgtatggaaatgtctggatgtccaggcagaag tttgcttcaggagcagggccatcatggagaacctctgctagggcagtgtggaagggaaat atgagatcagagtccccacacagagtccccactgggtcactgcctaatggagctgtgaga agagggccaccatcctgcagaccccagaatggtagatccactgacagcttgcaatgtgca ccttga >gi568815578r:44994873_45198697|GENSCAN_predicted_peptide_9|168_aa MRASSFLIVVVFLIAGTLVLEAAVTGVPVKGQDTVKGRVPFNGQDPVKGQVSVKGQDKVK AQEPVKGPVSTKPGSCPIILIRCAMLNPPNRCLKDTDCPGIKKCCEGSCGMACFVPQGSR SLLHLCRPQSYRPHLVLSPCCPSPSHTVHSSSHSGCPRLELPLSSTFQ >gi568815578r:44994873_45198697|GENSCAN_predicted_CDS_9|507_bp atgagggccagcagcttcttgatcgtggtggtgttcctcatcgctgggacgctggttcta gaggcagctgtcacgggagttcctgttaaaggtcaagacactgtcaaaggccgtgttcca ttcaatggacaagatcccgttaaaggacaagtttcagttaaaggtcaagataaagtcaaa gcgcaagagccagtcaaaggtccagtctccactaagcctggctcctgccccattatcttg atccggtgcgccatgttgaatccccctaaccgctgcttgaaagatactgactgcccagga atcaagaagtgctgtgaaggctcttgcgggatggcctgtttcgttccccaagggagccgg tccttgctgcacctgtgccgtccccagagctacaggccccatctggtcctaagtccctgc tgcccttccccttcccacactgtccattcttcctcccattcaggatgcccacggctggag ctgcctctctcatccactttccaataa >gi568815578r:44994873_45198697|GENSCAN_predicted_peptide_10|183_aa MECYAAIKNDEFMSFVGTWMKLEIIILSKLSQEQKTKHRIFSLRAQYEGSFTQGTSSQVM GESLSSHFPLKGYDISPEAKGCQHLSNPFQGCIAEIPGCQAQSNGPPFMAQIKWIHQSDL IRKLALHLSSRPSKATEHSVTERPSFSSLGLSFLLLNVRRWFVIMSRDGVTEVSENRELI QVC >gi568815578r:44994873_45198697|GENSCAN_predicted_CDS_10|552_bp atggaatgctatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatg aaattggaaatcatcattctcagtaaactatcacaagaacaaaaaaccaaacaccgcata ttctcactcagagctcaatatgagggaagcttcactcagggcacatctagccaagtaatg ggagagtctctttcctctcacttccctttaaagggttatgatatctcaccagaggcaaaa ggttgccagcatctctcaaacccattccaaggttgtattgcagagattccaggctgtcag gctcaatccaatggaccacctttcatggcccagatcaagtggatccatcaaagtgacctt attaggaagttagcgctacatcttagttctaggcccagcaaggctacggagcactctgtg actgaaagaccttcattctcctctctgggactcagttttctcctccttaatgtgagaagg tggttcgtgataatgtcaagggatggtgttactgaggttagtgagaacagggaacttatc caggtctgttag