GENSCAN 1.0 Date run: 6-Nov-116 Time: 03:41:59 Sequence gi568815590r:42756162_42987864 : 231703 bp : 42.57% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 663 22 642 2 0 72 94 348 0.644 24.01 1.05 Intr - 876 767 110 0 2 60 27 184 0.540 7.66 1.04 Intr - 3243 3200 44 1 2 40 78 66 0.052 -2.16 1.03 Intr - 5673 5139 535 0 1 2 10 357 0.024 10.87 1.02 Intr - 9043 8904 140 2 2 122 89 135 0.954 16.26 1.01 Init - 12269 12191 79 2 1 72 100 59 0.334 4.78 1.00 Prom - 14845 14806 40 -6.05 2.00 Prom + 21366 21405 40 -5.95 2.01 Init + 31713 31806 94 2 1 78 33 125 0.057 6.59 2.02 Intr + 39442 39516 75 2 0 90 75 59 0.050 3.37 2.03 Intr + 40504 40594 91 2 1 137 80 6 0.024 2.83 2.04 Term + 58450 58909 460 1 1 84 42 346 0.343 23.18 2.05 PlyA + 60124 60129 6 1.05 3.16 PlyA - 60657 60652 6 1.05 3.15 Term - 82175 81801 375 2 0 81 42 423 0.915 30.75 3.14 Intr - 87034 86863 172 0 1 22 61 258 0.876 15.52 3.13 Intr - 87511 87423 89 1 2 52 68 54 0.870 -2.35 3.12 Intr - 88003 87819 185 2 2 84 14 92 0.630 -0.01 3.11 Intr - 90823 90715 109 1 1 53 58 92 0.614 1.64 3.10 Intr - 94245 94117 129 0 0 64 98 105 0.835 9.07 3.09 Intr - 105694 105584 111 1 0 84 84 79 0.529 6.76 3.08 Intr - 113951 113843 109 1 1 68 91 93 0.421 6.97 3.07 Intr - 131710 131567 144 0 0 120 94 181 0.654 20.38 3.06 Intr - 140158 140005 154 2 1 25 -5 186 0.005 1.21 3.05 Intr - 140988 140459 530 2 2 47 13 326 0.074 12.65 3.04 Intr - 141417 141079 339 2 0 104 94 82 0.171 4.06 3.03 Intr - 142319 142218 102 1 0 77 98 26 0.128 0.87 3.02 Intr - 146004 145891 114 2 0 29 91 105 0.097 3.54 3.01 Init - 174581 174535 47 2 2 72 115 36 0.355 4.91 3.00 Prom - 178717 178678 40 -3.65 4.00 Prom + 179010 179049 40 -5.65 4.01 Init + 182751 182876 126 2 0 68 37 62 0.300 -0.69 4.02 Intr + 187152 187284 133 2 1 86 86 114 0.913 10.30 4.03 Intr + 194227 194294 68 2 2 83 87 67 0.686 3.81 4.04 Intr + 200933 200995 63 1 0 63 76 89 0.862 3.20 4.05 Intr + 203070 203153 84 2 0 84 110 107 0.997 11.60 4.06 Intr + 208150 208313 164 0 2 77 98 92 0.998 6.95 4.07 Intr + 210312 210452 141 0 0 42 83 284 0.999 21.85 4.08 Intr + 211852 212053 202 1 1 71 98 173 0.999 14.97 4.09 Intr + 217128 217238 111 2 0 83 86 135 0.999 12.46 4.10 Intr + 217946 218033 88 1 1 94 93 83 0.979 8.02 4.11 Intr + 226466 226535 70 0 1 80 110 58 0.870 4.52 4.12 Intr + 230494 230634 141 1 0 69 48 178 0.262 10.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 66998 66912 87 2 0 98 40 70 0.807 -0.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:42756162_42987864|GENSCAN_predicted_peptide_1|517_aa MLTSKGQGFLHGGLCLWLCVFTPFFKGCVGCATEERLFHKLFSHYNQFIRPVENVSDPVT VHFEVAITQLANVPLMQLLCQLIDFSKKEVQGDGTRNGVDAVGLWTPPAEMSDSSAMPIG GIWNWMEREYDLPDRSCSIGSADISEGVHIPMGHVHTTWSFSEPAEVRLWSWLLPTDLSL LNTVCLFRDKAGFEGQLGLMPSPPRPSWNRDCRASPFLTNSEWIYGHGPCWTLQIARGRS LPEGMTAGLCNQTFMGCRDRKHFTEEIWNDYKLRWDPMEYDGIETLRVPADKIWKPDIVL YNNAVGDFQVEGKTKALLKYNGMITWTPPAIFKSSCPMDITFFPFDHQNCSLKFGSWTYD KAEIDLLIIGSKVDMNDFWENSEWEIIDASGYKHDIKYNCCEEIYTDITYSFYIRRLPMF YTINLIIPCLFISFLTVLVFYLPSDCGEKVTLCISVLLSLTVFLLVITETIPSTSLVVPL VGEYLLFTMIFVTLSIVVTVFVLNIHYRTPTTHTMPS >gi568815590r:42756162_42987864|GENSCAN_predicted_CDS_1|1551_bp atgctgaccagcaaggggcagggattccttcatgggggcttgtgtctctggctgtgtgtg ttcacacctttctttaaaggctgtgtgggctgtgcaactgaggagaggctcttccacaaa ctgttttctcattacaaccagttcatcaggcctgtggaaaacgtttccgaccctgtcacg gtacactttgaagtggccatcacccagctggccaacgtgcctttgatgcagctgctgtgt caattaattgatttttccaagaaggaggtacagggtgatggcaccaggaatggagttgat gctgtgggtctttggactcctccggcagagatgtctgatagcagtgccatgcccattggt gggatctggaattggatggagagagagtatgacttgccagacaggtcctgctccataggg tcagctgacatctctgaaggtgtccacatccccatgggccatgtccataccacctggagc ttctccgagcctgcggaagtgaggctgtggagttggctattgccaactgatctcagcctt ctgaacacagtgtgcttattcagagacaaagcaggctttgaagggcagttagggcttatg ccctcacccccaaggccttcctggaaccgagactgcagggctagtccattcctgaccaac agtgagtggatttatggacatggcccctgctggactctgcagattgcaaggggacgctca ttaccagaaggcatgaccgcaggcttgtgtaatcaaacctttatgggctgcagagatcgg aagcacttcacagaagagatctggaatgattataaattgcgctgggatccaatggaatat gatggcattgagactcttcgcgttcctgcagataagatttggaagcccgacattgttctc tataacaatgctgttggtgacttccaagtagaaggcaaaacaaaagctcttcttaaatac aatggcatgataacctggactccaccagctatttttaagagttcctgccctatggatatc acctttttcccttttgatcatcaaaactgttccctaaaatttggttcctggacgtatgac aaagctgaaattgatcttctaatcattggatcaaaagtggatatgaatgatttttgggaa aacagtgaatgggaaatcattgatgcctctggctacaaacatgacatcaaatacaactgt tgtgaagagatatacacagatataacctattctttctacattagaagattgccgatgttt tacacgattaatctgatcatcccttgtctctttatttcatttctaaccgtgttggtcttt taccttccttcggactgtggtgaaaaagtgacgctttgtatttcagtcctgctttctctg actgtgtttttgctggtcatcacagaaaccatcccatccacatctctggtggtcccactg gtgggtgagtacctgctgttcaccatgatctttgtcacactgtccatcgtggtgactgtg tttgtgttgaacatacactaccgcaccccaaccacgcacacaatgcccagn >gi568815590r:42756162_42987864|GENSCAN_predicted_peptide_2|239_aa MSEHQKEQTAGTPSVRTVTLTVRVRGFILKVRLAEVGASSFCLQRGVERHRQERDQRGRR LGRPRTQSSRPAPLAPGTELLSTRASSEDHSPSQPLQGDPSLSSSPVKAARALEFTLALA GCDAARAHRSKEEEDEAGLSPPALGGLSLTRPGLRFFRLRFPGEALGAFPAYRRHSVPQE VLILLGLQGEGVHSSPETAKVMKNKVWCEDRTQCSSQNRKRTLSGTAVTSEYSWEFSSE >gi568815590r:42756162_42987864|GENSCAN_predicted_CDS_2|720_bp atgtccgaacatcagaaggaacaaactgctggcacaccatctgtaagaactgtaacactc accgtgagggtccgcggcttcattcttaaagtcaggctggctgaggtcggagccagctcc ttctgcttacagagaggtgtggagaggcacaggcaggaacgggaccagcgtgggcgtcgg ctcggcaggccccgcactcagagcagccggccagcaccactggccccaggcactgagctg cttagcacccgggccagcagcgaggaccactccccatcacagccactgcagggtgacccc tcactcagctccagcccggtcaaggctgcccgtgcacttgagttcacactcgcgctggct gggtgtgatgccgcccgagcacaccgctctaaggaggaggaagatgaagcgggactttca ccccctgccctgggtggtctctctctcaccaggcctgggctccgtttcttcagactgcgc ttccccggagaagcccttggggctttccctgcctatagacggcactcggtccctcaagaa gttctgatactgttgggtctccaaggggaaggagtccatagttctcctgaaactgccaag gtcatgaaaaacaaagtctggtgtgaggacagaacgcaatgcagctcccagaacagaaaa aggacgctaagtggaacagcggtgacctcagaatacagttgggaatttagttcagagtaa >gi568815590r:42756162_42987864|GENSCAN_predicted_peptide_3|902_aa MGKISATGINMGTKCSWALVWHLESYDPKHYEREGMQDWKTASGQSEEATQQSSQKPQPH YTTYQSSSFLKYSSESHLLAWRENSSEGSFQFPGRSRARPPRTRQQRRGAAAGPGRGAVR LGHPQSAAQPQLRAAARIPESPAAFPAQPRPGSARNSDASGPASLSRTLGRASSPRPPQA PDVTAPSPAALAPRAARGGSRDSTLNIFPAAARQRLPPGQTPPPLCRGIRGPAQLTRPQA RLLLGSGRRLSPSAGSGSAPSPPAPLLLLSLGTARQPRATSDTPAAPRPRPRPPTPLLPS AAALAGAEAEEPLRTLAPRPTRAAAPPPPPPPPPLPPGAPPPPVRCVSRRARAPPWRLRR RVLLRRPVAPSRKLGSAQSPRGCPESPATVEGPTRRAALGGFGGGEDTAAPSRVARSVTL RTLLNAGGVPGMAKYQGEVQSLKLDDDSVIEGVSDQVLVAVVVSFALIATLVYALFRMHL LPLDSSSTLTCTVPSACTKPPSRWRPTVDIFFVVTLLLTVFGEDDQSQDVLRLHQDINDY NRRFSGQPRSDHSDLPGSLPGTKCLAPYMQRSPMIISSVTLLESLLSLSHRVKNLPQPPQ PSATMISQQPSTSRQDSPSTKRSPFAEGSGVKIAAAVRYLQHSTWKSQQWRAFRKASGEG QPGGSTPVIPTHGEVEMGGSLELEFQTDLGNKWAQSANDLGFTPVLQEALVVPYCKLPQK SYKRSEGIQPSAATLRTASPAFCGGRSEVPEDRKDGAVLLRLRLQEPLRQGQARFFPQKE DLLEPQEQLPPPPLPPPVSQVDAAIGLLMPPLQTPVNLSVFCDHNYTVEDTMHQRKRIHQ LEQQVEKLRKKLKTAQQRCRRQERQLEKLKEVVHFQKEKDDVSERGYVILPNDYFEIVEV PA >gi568815590r:42756162_42987864|GENSCAN_predicted_CDS_3|2709_bp atggggaagatcagtgctacgggtataaacatgggtactaaatgcagctgggcactggtt tggcatttggaatcctatgatccaaagcactatgaaagggaagggatgcaagattggaaa acagcctctggccaatcagaagaagcaactcaacaaagcagtcagaaacctcagccacat tatacaacttaccagtcttcctcttttctgaaatacagcagtgaaagccatctccttgcc tggagggagaactcatcagaagggtcctttcagttcccgggccggagccgggcccgtcct ccgcgaacgcggcaacaaaggcgaggagcggcggcgggtcctggacgcggggcagtcagg ctcggccacccccagagcgcagcgcagccccagctgcgagcagccgcacggatcccagag tcgcccgccgcgttcccagcccagccccggccggggtccgcccggaacagcgacgcctca ggcccggcaagtctctctcggaccctgggccgagcctcctcgccccgcccgccccaggcc ccggatgtgacggcaccgtcgcccgcagcgctcgccccacgggccgcgcgtggagggtcc cgcgactctacgctgaacatcttccccgccgcggccaggcagcgcctaccgcctggccag acaccgccgccgctctgtcgggggatccgggggcccgcgcagctgactcggcctcaggcc cgcctgctcctcggatccggccgccgactctcgcccagcgctggcagcggctccgcaccg tcacccccagcgcccctccttctcctttcactcgggaccgcgcgccagccgcgcgcgact tcggacacgcccgccgccccgcgtccccgcccccggccgcccacaccgctcctccccagc gccgccgccctggctggagccgaggccgaggagccactgcgcacgctcgcgccccggccg acccgcgccgccgcgccgccgccgccgccgccgccgccgccgctgccgccgggcgcgccc ccgccaccagttcgctgcgtgtcgaggcgagcacgcgctccgccctggaggctgcggcga cgggtcctcctccgccgtccggtcgcgccctcgcggaagctcggcagtgcgcaatcgccc cgcggctgtccggagtcgccggcgacggtggaggggccgacgcggagagcggctctaggt gggtttggcggcggcgaggacaccgccgctccctctagggtcgctcggagcgtgaccctg agaactctccttaatgcaggaggggtacctggaatggccaaatatcaaggtgaagttcaa agtttgaaactggatgatgattcagttatagaaggagtaagcgaccaagtacttgtggca gttgtggtcagtttcgctttgattgctaccctggtatatgcacttttcaggatgcacctg ctgccactcgacagcagttctacactgacatgtactgtcccatctgcctgcaccaagcct ccttcccggtggagaccaactgtggacatcttttttgtggtaaccttactcctaacagta tttggtgaagatgatcagtctcaggatgttctgagattgcatcaggatattaatgattat aaccggagattctcagggcaacccagatctgatcacagcgaccttccaggcagtctccca ggcaccaagtgcctggctccctacatgcaacgcagcccgatgatcatctcaagtgtgact ctgctggaaagcctgctctccctttcccacagagtaaagaacttgccacagccacctcaa ccttcagcaaccatgatcagtcagcagccatcaacatcaaggcaagactctccatcaaca aaaagatcaccatttgctgaaggctcaggtgtgaaaattgctgcagcagttcgctatttg caacacagcacatggaaatcacagcaatggagagccttcagaaaagcttctggggaggga cagcctggtggctcaactcctgtaatcccaacccatggggaggtcgagatgggaggatcc cttgagctcgagttccagactgacctgggcaacaaatgggcgcaatcagccaacgacctc ggctttacacctgttctccaagaggccctagttgttccctactgtaagctgccccagaaa agctacaaaagaagcgagggaatccaaccgagcgcagcgacactgagaacagcttcccct gccttctgcggcggcagaagtgaagtgcctgaggaccggaaggatggtgcagtcctgctc cgcctacggctgcaagaaccgctacgacaaggacaagcccgtttctttccacaaaaagaa gatcttctggagccacaggaacagcttcccccacctcctttaccgcctcctgtttcccag gttgatgctgctattggattactaatgccgcctcttcagacccctgttaatctctcagtt ttctgtgaccacaactatactgtggaggatacaatgcaccagcggaaaaggattcatcag ctagaacagcaagttgaaaaactcagaaagaagctcaagaccgcacagcagcgatgcaga aggcaagaacggcagcttgaaaaattaaaggaggttgttcacttccagaaagagaaagac gacgtatcagaaagaggttatgtgattctaccaaatgactactttgaaatagttgaagta ccagcataa >gi568815590r:42756162_42987864|GENSCAN_predicted_peptide_4|464_aa MTLNEHAAFKHLFNKAHLAPPLIHLTLSGYSTCFREHRVGGKILGQQINDFTLPDVNLIG EHSDAAELGRMLQLILGCAVNCEQKQEYIQAIMMMEESVQHVVMTAIQELMSKESPVSAG NDAYVDLDRQLKKTTEELNEALSAKEEIAQRCHELDMQVAALQEEKSSLLAENQVLMERL NQSDSIEDPNSPAGRRHLQLQTQLEQLQEETFRLEAAKDDYRIRCEELEKEISELRQQND ELTTLADEAQSLKDEIDVLRHSSDKVSKLEGQVESYKKKLEDLGDLRRQVKLLEEKNTMY MQNTVSLEEELRKANAARSQLETYKRQVVELQNRLSEESKKADKLDFEYKRLKEKVDSLQ KEKDRLRTERDSLKETIEELRCVQAQEGQLTTQGLMPLGSQESSDSLAAEIVTPEIREKL IRLQHENKMLKLNQEGSDNEKIALLQSLLDDANLRKNELETENS >gi568815590r:42756162_42987864|GENSCAN_predicted_CDS_4|1392_bp atgactcttaacgagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccatttaaccctgagtggatacagcacatgtttcagagagcacagggttggg ggtaagattttaggacagcaaattaatgactttacccttcctgatgtgaaccttattggg gagcattctgatgcagcagagcttggaaggatgcttcagctcatcttaggctgtgctgtg aactgtgaacagaagcaagagtacatccaagccattatgatgatggaggaatctgttcaa catgttgtcatgacagccattcaagagctgatgagtaaagaatctcctgtctctgctgga aatgatgcctatgttgaccttgatcgtcagctgaagaaaactacagaggaactaaatgaa gctttgtcagcaaaggaagaaattgctcaaagatgccatgaactggatatgcaggttgca gcattgcaggaagagaaaagtagtttgttggcagagaatcaggtattaatggaaagactc aatcaatctgattctatagaagaccctaacagtccagcaggaagaaggcatttgcagctc cagactcaattagaacagctccaagaagaaacattcagactagaagcagccaaagatgat tatcgaatacgttgtgaagagttagaaaaggagatctctgaacttcggcaacagaatgat gaactgaccactttggcagatgaagctcagtctctgaaagatgagatcgacgtgctgaga cattcttctgataaagtatctaaactagaaggtcaagtagaatcttataaaaagaagcta gaagaccttggtgatttaaggcggcaggttaaactcttagaagagaagaataccatgtat atgcagaatactgtcagtctagaggaagagttaagaaaggccaacgcagcgcgaagtcaa cttgaaacctacaagagacaggtagtagaactacaaaacagattatccgaagaatcaaag aaagcagataaactagattttgaatataagcggctaaaagaaaaagttgacagtcttcaa aaagaaaaggacaggctgagaacagaaagggattctctgaaggaaaccattgaagagctt cgttgtgtacaagctcaagaagggcagctcacaacacaagggttaatgcctcttggaagt caggagtcttcagacagtctagctgcagagattgttacacctgaaatcagggagaaactt attcgtcttcagcatgagaataagatgttaaagcttaaccaagaaggttcggacaatgaa aaaatagccttattgcagagccttctagatgatgcaaatctacgcaagaatgaactggag acagagaatagn