GENSCAN 1.0 Date run: 5-Nov-116 Time: 03:17:44 Sequence gi568815591f:37640394_37841308 : 200915 bp : 38.07% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 14976 15167 192 2 0 42 44 228 0.825 8.69 1.02 PlyA + 16990 16995 6 1.05 2.04 PlyA - 17759 17754 6 1.05 2.03 Term - 25488 25148 341 1 2 98 40 165 0.788 6.41 2.02 Intr - 28560 28432 129 1 0 49 78 99 0.282 4.75 2.01 Init - 38643 38628 16 0 1 57 99 7 0.131 -0.82 2.00 Prom - 52088 52049 40 -3.65 3.00 Prom + 64075 64114 40 -5.45 3.01 Init + 69327 69406 80 2 2 75 115 42 0.904 6.18 3.02 Intr + 78577 78856 280 1 1 79 30 168 0.107 6.66 3.03 Term + 83726 83854 129 1 0 106 54 97 0.356 5.30 3.04 PlyA + 83887 83892 6 -0.45 4.03 PlyA - 84097 84092 6 1.05 4.02 Term - 85087 84919 169 0 1 23 47 114 0.062 -2.83 4.01 Init - 90275 90142 134 2 2 100 110 101 0.535 13.16 4.00 Prom - 95036 94997 40 -5.75 5.00 Prom + 98367 98406 40 -6.85 5.01 Init + 100157 100621 465 1 0 118 16 371 0.150 28.59 5.02 Intr + 110020 110409 390 0 0 32 87 273 0.008 15.59 5.03 Intr + 123130 123304 175 0 1 29 31 116 0.000 -1.41 5.04 Intr + 131502 131598 97 1 1 87 78 39 0.000 0.95 5.05 Intr + 135770 135902 133 2 1 47 55 91 0.001 1.43 5.06 Intr + 142364 142471 108 1 0 86 93 2 0.006 0.06 5.07 Term + 156093 156272 180 2 0 107 44 164 0.970 10.53 5.08 PlyA + 157536 157541 6 -0.45 6.00 Prom + 161697 161736 40 -5.55 6.01 Init + 164072 164081 10 1 1 85 86 18 0.224 0.97 6.02 Intr + 165008 165187 180 0 0 79 36 185 0.414 11.32 6.03 Intr + 167943 168110 168 1 0 13 86 128 0.127 4.10 6.04 Intr + 179352 179624 273 1 0 52 36 163 0.100 4.09 6.05 Intr + 186574 186652 79 2 1 83 94 54 0.235 3.19 6.06 Intr + 188862 189079 218 0 2 84 59 129 0.185 6.82 6.07 Term + 193243 193403 161 2 2 34 38 141 0.145 0.82 6.08 PlyA + 193415 193420 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 179352 179665 314 1 2 52 42 174 0.875 3.38 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:37640394_37841308|GENSCAN_predicted_peptide_1|63_aa MSTRERELVLEATHHEVIQQGKMHKHTKRPGASGDPTVATEHITHPHTESKSTSTYENRE EYE >gi568815591f:37640394_37841308|GENSCAN_predicted_CDS_1|192_bp atgtccacaagggaaagggaactggtattggaagcaacacatcatgaagtaatacaacaa ggcaaaatgcacaaacacaccaagagacctggtgccagtggagaccccactgtggccact gagcacattacacatccacatacagaatctaagagcacatccacatacgaaaaccgggaa gaatatgagtag >gi568815591f:37640394_37841308|GENSCAN_predicted_peptide_2|161_aa MNETRAFSSEKGAINIVYLLAIEGIHRLQEKQSTCHEENTSDSNTTLRGYREKLNTVLSG DSLSALSSASCPESPMLPRPSRAEGKRHPILHCLSVIALSGGFKKTLSSLSLQLMKNTGN ICRVPTAPSFQKHQVAFLLYSAAPMAVYHQDSTGSLFSYSQ >gi568815591f:37640394_37841308|GENSCAN_predicted_CDS_2|486_bp atgaatgaaactcgagctttctcatctgaaaaaggagcaatcaatatcgtctacctttta gccattgaagggattcacagattgcaggaaaaacaaagtacttgtcacgaagaaaacacc agtgactcaaataccactcttagaggctatagagaaaaacttaacacagttctttctgga gattccttatctgccctgagcagtgcctcatgtccagagtctccaatgctgccaaggccc agcagggcagaaggaaaaaggcatccgatccttcattgcttgtctgtcattgccctatca ggtggctttaagaagacattaagctctctgtcactgcagctgatgaaaaacacagggaac atctgcagggtacccacagccccatctttccagaaacaccaggtagcctttctcttgtat tcggctgcccctatggcagtataccaccaggacagcacaggatccctcttttcttacagc caataa >gi568815591f:37640394_37841308|GENSCAN_predicted_peptide_3|162_aa MDKRNPCKRQLNWMFPSNGKVTLNGPRVGDRVGGHYGGKKNKLSCLCVKSFTRRFQLSPV TWESCDVEHGQFILKNVHTVLVGSAMPESKVGPGNKCLWLLGNQHYHKTVVLIEPKVEVC VTRTCILTFRESAAMVNSSSEERKDSSNPEACSLVKNDVFDE >gi568815591f:37640394_37841308|GENSCAN_predicted_CDS_3|489_bp atggataaaagaaatccctgtaaaaggcagcttaactggatgtttccctccaatggcaaa gtgactctcaatgggccaagggtgggggatagggttggagggcattatggaggcaagaag aataagctttcttgcctgtgcgtgaagagtttcaccagacgatttcagctttctcctgtg acctgggaaagctgtgatgtggagcatggccagtttatacttaaaaatgtccacacagtt ctggttggttctgccatgccagaatcaaaggttggacccggaaacaaatgcctttggctg cttggtaaccagcactatcacaagacagttgtgttgattgagcctaaggtggaagtctgt gtaaccagaacttgcatcctcacctttagagagtcagcagcaatggtgaactcctcaagt gaggagagaaaggatagctcaaatccagaagcctgctctttggtgaaaaatgatgtcttt gatgagtga >gi568815591f:37640394_37841308|GENSCAN_predicted_peptide_4|100_aa MAMAGPWDPGTSKPAPAVTKTSATDGFDACYIQNLPSVVKTNTCRAGNSLVRGLGGCRAR NQEPWMPHPAQVRRPGPSDVAAHSSLLPAPSLASLLSAAP >gi568815591f:37640394_37841308|GENSCAN_predicted_CDS_4|303_bp atggccatggccgggccctgggaccctggaacaagcaagccagcacctgcagtcacaaaa acaagtgcaactgatggctttgacgcgtgttatatacagaacctgccaagtgtggttaaa acaaacacttgcagagcaggaaactccctggttagaggactaggaggatgccgtgccagg aaccaggagccctggatgcctcaccctgctcaagttcgcaggcctggtccctctgatgtg gccgctcattcctccttgctccccgctccatccctggcttcacttctttctgctgcacct tga >gi568815591f:37640394_37841308|GENSCAN_predicted_peptide_5|515_aa MAVINLVVVHSVFLLTVPFRLTYLIKKTWMFGLPFCKFVSAMLHIHMYLTFLFYVVILVT RYLIFFKCKDKVEFYRKLHAVAASAGMWTLVIVIVVPLVVSRYGIHEEYNEEHCFKFHKE LAYTYVKIINYMIVIFVIAVAVILLVFQVFIIMLMPTSHTHTPSTEMQTLQERVGPLDDG EVHKGALLMELAEKPLIGVLEKAVQGEKLHLRPCWASAQGTECLWKLLSEGQGWLSTRRP HALRALGRSLLRESRYQSQGEKPILPQVLLQRLVMMKISLASHPLGKGGSHLYSSRPRFS SAGAKEAGRLGPKRYSPQPNKPAVADYGQSASSGLTLTHSSSLVPLPPVPNPSAYPANFC SSPNVSCQPTAKREMSLGNWCPASQLPQPWLKAAMVQLRPWLHRVQAPSLGSFHMLLSLQ GSTDFQCKAPQLLTFPPASTQIISPCSTWCCGMMGKYGIDPSYSEQRCFEFCKDLNHREF IIMNYSVIVTTDDNGCDPLPDTDSCHHSTDKSSLT >gi568815591f:37640394_37841308|GENSCAN_predicted_CDS_5|1548_bp atggcggtcattaacttggtggtggtccacagcgtttttctgctgacagtgccatttcgc ttgacctacctcatcaagaagacttggatgtttgggctgcccttctgcaaatttgtgagt gccatgctgcacatccacatgtacctcacgttcctattctatgtggtgatcctggtcacc agatacctcatcttcttcaagtgcaaagacaaagtggaattctacagaaaactgcatgct gtggctgccagtgctggcatgtggacgctggtgattgtcattgtggtacccctggttgtc tcccggtatggaatccatgaggaatacaatgaggagcactgttttaaatttcacaaagag cttgcttacacatatgtgaaaatcatcaactatatgatagtcatttttgtcatagccgtt gctgtgattctgttggtcttccaggtcttcatcattatgttgatgcccacctcacataca cacacacccagcactgaaatgcagaccttgcaggaaagggtgggtcccctggatgatgga gaagttcacaaaggtgctttgctgatggaacttgctgaaaagccactcataggggtatta gaaaaagctgtccaaggggagaaactgcatctcaggccatgctgggcatctgcccaagga accgaatgcctttggaagctgctctctgaagggcagggatggctgtccacgaggaggcca catgcattgcgtgcattaggcaggagcctgctacgagaaagcaggtaccagagccaggga gaaaagcctattcttccccaagtgctgcttcagcgccttgtaatgatgaagattagttta gcatcccacccattggggaagggcggcagccatctctatagctccaggccacgcttttcc tctgctggagccaaggaggctggacggttgggtcccaagaggtattcgccacagcctaac aaaccggctgtggcagactacggccaaagtgcctcttcaggcctgaccttgacccattcc tcctcactggtcccattgcccccagttccaaatccaagtgcttatccagccaatttttgc tcttctccaaatgtaagctgtcagcccacagctaaaagagagatgagcctagggaattgg tgccctgcatcccagctgccccagccatggctaaaagcggccatggtacagctcaggccg tggcttcatagggtgcaagccccaagccttggcagtttccacatgctgttgagcctgcag ggaagcactgatttccaatgcaaagccccacaattactcacttttcctcctgcaagcaca caaattatctctccatgctccacatggtgctgtggaatgatgggaaagtatggcatagat ccaagttactcagaacaacgatgctttgaattctgcaaagatcttaaccacagagaattc attatcatgaactactctgtgattgtcactacggatgacaatggttgtgatcctcttcct gatacagacagctgtcatcattcaactgataaaagctctttgacctga >gi568815591f:37640394_37841308|GENSCAN_predicted_peptide_6|362_aa MWLDNDAASPVMVLNQNEMAKMTDIELRIWIALKIIEIQEKIETQSKESKKSSKVIQEVK GKIENKIPRNPTYKGHEGPLQGELQTTAQGNKRGYKQMEEHSMLMGRKNQYRENGHTAQG LNGKGSECLSKVERGNNVERAALKDALILGSETQPSQGNTTEHQPVEYLSQSYSSPSLKS FVVGAEFTRSQKAREPVDVVHIDQLLWVEQVSQGTFPDHATEDTLMITTGIFVAQSTLLQ ICLLLTLYPAAATPSSSQLPKVGWPLSRQGHRTGSLCGGTQRESFQHHFLQEAYSDPWYS KQLLSMPTPGQGGFRALAAERGRVDVYLIFCCAAAQPPLRKCTLPPYGRLEKLHLHVINY IK >gi568815591f:37640394_37841308|GENSCAN_predicted_CDS_6|1089_bp atgtggctggacaatgatgccgcttcaccagtgatggttctcaaccagaatgaaatggct aaaatgacagacatagaattaagaatctggatagcattgaagatcattgagattcaggag aaaattgaaactcaatccaaggaatctaagaaatccagtaaagtgattcaagaggtgaaa ggcaaaatagagaataaaatacctaggaatccaacttacaagggacatgaaggacctctt caaggagaactacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaa cattccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggc ctgaatggcaaagggagtgagtgtttatcaaaggtggaaagagggaataatgtggagaga gctgctttgaaagacgctctgatcttaggttcagaaacgcaaccatctcaaggcaacacc acagagcatcaaccggtggaatacttatcacaatcttattcttctccatccctcaaatct tttgttgttggggctgaattcaccagaagccagaaggcaagagagcctgttgatgtagtt cacatagatcagcttctctgggtggagcaggtctctcaagggactttcccagaccacgct actgaagacactcttatgatcaccactggcatctttgttgctcagtccacgctcctccag atctgcctcctgctgaccttgtacccagctgcagcgacaccgagctcctcacagcttccc aaagtgggctggcccctctcacgacagggtcatcgcacaggctccctctgtgggggaaca caacgggagtcatttcaacaccactttcttcaggaagcctattctgatccttggtactcc aagcagctgttgtcaatgcctaccccagggcaggggggcttcagagccttggcagcagaa cgtggtcgcgtagatgtttatctgattttctgttgcgctgctgctcaaccgccactacgg aagtgcacactgccaccttatggacgtcttgagaagttacacttacacgttattaattat attaaataa