GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:46:21 Sequence gi568815586r:94922232_95167384 : 245153 bp : 41.34% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 50 45 6 -3.84 1.02 Term - 440 119 322 1 1 80 38 207 0.216 8.21 1.01 Init - 1465 1422 44 1 2 85 55 83 0.418 4.64 1.00 Prom - 2225 2186 40 -5.05 2.07 PlyA - 4192 4187 6 1.05 2.06 Term - 4693 4529 165 1 0 31 44 387 0.928 25.53 2.05 Intr - 5989 5879 111 1 0 52 37 102 0.093 1.16 2.04 Intr - 26631 26338 294 0 0 2 -1 291 0.145 7.88 2.03 Intr - 26748 26662 87 0 0 50 80 95 0.632 4.15 2.02 Intr - 28868 28748 121 1 1 55 83 82 0.683 3.98 2.01 Init - 30846 30788 59 0 2 97 72 40 0.772 4.13 2.00 Prom - 40619 40580 40 -5.15 3.07 PlyA - 42306 42301 6 1.05 3.06 Term - 49389 49209 181 2 1 96 43 139 0.217 6.30 3.05 Intr - 58212 58185 28 1 1 63 94 27 0.029 -2.94 3.04 Intr - 67039 66832 208 1 1 50 56 151 0.036 5.83 3.03 Intr - 72026 71953 74 0 2 76 6 56 0.008 -5.59 3.02 Intr - 80590 80508 83 0 2 81 97 63 0.958 4.86 3.01 Init - 81449 81364 86 2 2 104 96 103 0.998 13.04 3.00 Prom - 82830 82791 40 -5.65 4.02 PlyA - 83019 83014 6 1.05 4.01 Sngl - 91193 90858 336 2 0 49 37 202 0.856 6.98 4.00 Prom - 98907 98868 40 -7.75 5.13 PlyA - 98972 98967 6 1.05 5.12 Term - 100172 99998 175 1 1 133 48 106 0.994 7.35 5.11 Intr - 103024 102919 106 2 1 66 70 87 0.856 3.05 5.10 Intr - 109257 109118 140 2 2 68 93 49 0.463 2.59 5.09 Intr - 118366 118245 122 1 2 59 96 51 0.379 1.37 5.08 Intr - 127002 126837 166 2 1 61 30 130 0.848 3.54 5.07 Intr - 129712 129531 182 1 2 45 116 160 0.936 12.24 5.06 Intr - 135412 135322 91 0 1 75 93 95 0.990 7.78 5.05 Intr - 135647 135500 148 0 1 23 98 173 0.925 10.17 5.04 Intr - 136258 136079 180 2 0 95 40 81 0.825 2.92 5.03 Intr - 137753 137675 79 2 1 104 92 40 0.997 4.21 5.02 Intr - 140507 140277 231 2 0 32 99 127 0.947 5.25 5.01 Init - 145153 145100 54 1 0 63 87 60 0.664 4.73 5.00 Prom - 145975 145936 40 -6.65 6.00 Prom + 147616 147655 40 -5.65 6.01 Init + 148056 148063 8 2 2 103 80 0 0.740 1.25 6.02 Intr + 150395 150606 212 2 2 56 41 167 0.436 6.43 6.03 Intr + 151036 151319 284 2 2 61 24 254 0.399 12.51 6.04 Intr + 151370 151529 160 1 1 49 91 206 0.823 15.64 6.05 Intr + 162599 162738 140 0 2 103 121 -42 0.003 -0.14 6.06 Intr + 170419 170501 83 0 2 29 94 44 0.011 -3.38 6.07 Intr + 174296 174467 172 2 1 60 110 101 0.163 8.52 6.08 Intr + 176987 177054 68 1 2 62 77 40 0.051 -2.92 6.09 Term + 178197 178686 490 0 1 34 48 387 0.175 22.44 6.10 PlyA + 179263 179268 6 1.05 7.09 PlyA - 181029 181024 6 1.05 7.08 Term - 182505 182484 22 2 1 90 37 -4 0.245 -8.49 7.07 Intr - 182855 182776 80 2 2 95 99 70 0.726 6.23 7.06 Intr - 184806 184723 84 0 0 72 93 78 0.982 5.80 7.05 Intr - 186188 186117 72 2 0 104 102 42 0.982 5.88 7.04 Intr - 186330 186272 59 1 2 102 83 68 0.965 5.38 7.03 Intr - 191470 191420 51 2 0 122 94 49 0.947 6.86 7.02 Intr - 194695 194607 89 0 2 85 93 -18 0.441 -2.90 7.01 Init - 194880 194819 62 0 2 57 58 72 0.519 1.87 7.00 Prom - 196174 196135 40 -7.75 8.00 Prom + 196490 196529 40 -7.65 8.01 Sngl + 203529 204194 666 2 0 86 42 579 0.650 49.02 8.02 PlyA + 204540 204545 6 1.05 9.05 PlyA - 206016 206011 6 1.05 9.04 Term - 212595 212437 159 0 0 96 47 97 0.600 3.36 9.03 Intr - 215447 215291 157 1 1 101 53 154 0.967 12.29 9.02 Intr - 219308 219157 152 2 2 52 58 247 0.961 16.14 9.01 Intr - 230610 230580 31 2 1 111 87 21 0.070 1.51 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 39253 39389 137 2 2 76 48 71 0.848 -0.90 S.002 Term - 59675 59562 114 2 0 42 42 140 0.856 2.39 S.003 Term + 174614 174802 189 1 0 102 35 84 0.841 0.97 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:94922232_95167384|GENSCAN_predicted_peptide_1|121_aa MSRVLVAHGPHPVASLSLSNPSSASEYPAERAQLLSAEVLLRGPGRQDSSLVLCYPHMVL GSLLHPLPQETPGVPADPNIEALLVLPSPKHVASFVTEENILNFLGGVGLQFVVHSKFLA L >gi568815586r:94922232_95167384|GENSCAN_predicted_CDS_1|366_bp atgagcagagtcctcgtggcccatgggccacatcccgttgcaagcttgtctctttcaaat ccttcctcagcctctgagtatccagctgaacgtgctcagctgctttctgctgaggtcctt ctcagaggacctggtagacaggactcaagccttgtcttgtgttacccccatatggtcttg ggaagtctccttcatcctttgcctcaggaaactccaggagtgcccgctgatcccaatata gaagccctgcttgttcttccatctccaaagcacgtagccagctttgtaacggaggaaaac attctgaattttcttggtggagttggacttcagttcgtggtgcactctaagtttcttgct ctttag >gi568815586r:94922232_95167384|GENSCAN_predicted_peptide_2|278_aa MHSNTLQEGEHTDGQMQELGHSPQAPLTSPLHTQTTHIINDDAREDPGFVETKVYTICGE EGKCDAKLTKCEAMRKILPEPSKDLESAQFHSRSAPGDAAVSRGPLCEAAVIRGPLCEAA IVWGSGKLKEHQSELDLKSKLLSLFRSKVLPLACVLRATPLFSLALLCMAGSQQPPEGER GDKGEARELKIIFNASQQKTEVLSPTSHKEMNSANNRVNETAVLKKKKKEEEEEEEEEEE EEEEEEEEEEEEEGRRRRRSSPSCYSITPELSCKLGHR >gi568815586r:94922232_95167384|GENSCAN_predicted_CDS_2|837_bp atgcactcaaacaccttacaggagggggagcacacagatgggcagatgcaggagctggga catagccctcaagcacctctcacttcacctctccacacccaaacaactcacataataaat gatgatgccagggaagatccaggttttgtggaaactaaggtttatacaatttgtggggag gaggggaagtgtgatgccaagttgactaaatgtgaggccatgaggaaaatattaccagag ccctccaaggaccttgagtctgcccaatttcacagtaggtctgccccgggtgatgcagcc gtcagtcgggggcccctgtgtgaagcggccgtcatccggggacccctgtgtgaagcagcc attgtctggggatctggcaaattgaaggagcaccagtctgagttggatctgaaatctaaa ttgttgtccttattcagaagcaaagtgctgcccctggcctgtgttctcagagccactcct ctcttttcccttgctctgctctgtatggcaggaagccaacagccacctgaaggagaacgt ggggacaagggagaagccagggaactgaagataatcttcaatgccagccagcaaaaaacg gaggtcctcagtccaacatcccacaaggaaatgaattctgccaacaaccgtgtgaatgag actgcagtcttgaagaagaagaagaaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaaggaagaagaagaagaagaagc agcccatcttgttattctataacccctgaactaagttgtaaacttggccatcgttga >gi568815586r:94922232_95167384|GENSCAN_predicted_peptide_3|219_aa MELVQVLKRGLQQITGHGGLRGYLRVFFRTNDAKVGTLVGEDKYGNKYYEDNKQFFGRHR WVVYTTEMNGKNTFWDVDGSMGKAEKHPLYNCGNEKCKEDNMKLGLMGIKSNGRVRWLFY TAVRESFFEKVTFEERRRSGEKAFQAEGTTEAFVEKQGGRHRWLHSMTDDPPTTKPLTAR KFIWTNHKFNVTGTPEQYVPYSTTRKKIQEWIPPSTPYK >gi568815586r:94922232_95167384|GENSCAN_predicted_CDS_3|660_bp atggagttagtgcaggtcctgaaacgcgggctgcagcagatcaccggccacggcggtctc cgaggctatctacgggtttttttcaggacaaatgatgcgaaggttggtacattagtgggg gaagacaaatatggaaacaaatactatgaagacaacaagcaattttttggccgtcaccga tgggttgtatatactactgaaatgaatggcaaaaacacattctgggatgtggatggaagc atggggaaggcagaaaaacatcctttatataattgtggtaatgaaaaatgcaaggaagat aatatgaagctgggcctcatgggaatcaagagcaatggaagggtcaggtggctgttttat acagcagttagagaaagcttctttgagaaagtgacatttgaggaaagacgtagaagcggg gagaaagcgttccaggcagagggaaccacagaagcctttgtagagaaacagggtggcagg catcgttggcttcacagtatgactgatgatcctccaacaacaaaaccacttactgctcgt aaattcatttggacgaaccataaattcaacgtgactggcaccccagaacaatatgtacct tattctaccactagaaagaagattcaggagtggatcccaccttcaacaccttacaagtaa >gi568815586r:94922232_95167384|GENSCAN_predicted_peptide_4|111_aa MLKKSIRTDGLDLDKEQISELVHKCKEMVQNVKHKDENGKYAEKTKYSGNVKKIIGDQSR RVNVHLIPRFRKEVTENVGEKLFKTISTAKNVTSPNSKMMLNSNKHQLQKP >gi568815586r:94922232_95167384|GENSCAN_predicted_CDS_4|336_bp atgttgaaaaaatctatcaggactgatggactggacctagacaaggaacaaattagtgaa ctggtacataaatgcaaagaaatggttcagaatgtaaagcacaaagatgaaaatggaaaa tatgcagaaaaaacaaaatattcaggtaacgttaaaaagataataggggatcaatccaga agggtcaatgttcacctaatacccagatttcggaaagaagtaacagaaaatgtaggggaa aaattatttaaaacaatttctactgctaaaaatgtcacatctccaaattcaaagatgatg ctgaattccaacaaacatcaactacaaaagccctag >gi568815586r:94922232_95167384|GENSCAN_predicted_peptide_5|557_aa MATIEEIAHQIIEQQMGEIVTEQQTGQKIQIVTALDHNTQGKQFILTNHDGSTPSKVILA RQDSTPGKVFLTTPDAAGVNQLFFTTPDLSAQHLQLLTDNSPDQGPNKVFDLCVVCGDKA SGRHYGAVTCEGCKGFFKRSIRKNLVYSCRGSKDCIINKHHRNRCQYCRLQRCIAFGMKQ DSVQCERKPIEVSREKSSNCAASTEKIYIRKDLRSPLTATPTFVTDSESTRSTGLLDSGM FMNIHPSGVKTESAVLMTSDKAESCQGDLSTLANVVTSLANLGKTKDLSQNSNEMSMIES LSNDDTSLCEFQEMQTNGDVSRAFDTLAKALNPGESTACQSSVAGMEGSVHLITGDSSIN YTEKEGPLLSDSHVAFRLTMPSPMPEYLNVHYIGESASRLLFLSMHWALSIPSFQALGQE NSISLVKAYWNELFTLGLAQCWQVMNVATILATFVNCLHNSLQQDHPSLENMEQIEKFQE KAYVEFQDYITKTYPDDTYRLSRLLLRLPALRLMNATITEELFFKGLIGNIRIDSVIPHI LKMEPADYNSQIIGHSI >gi568815586r:94922232_95167384|GENSCAN_predicted_CDS_5|1674_bp atggcaaccatagaagaaattgcacatcaaattattgaacaacagatgggagagattgtt acagagcagcaaactgggcagaaaatccagattgtgacagcacttgatcataatacccaa ggcaagcagttcattctgacaaatcacgacggctctactccaagcaaagtcattctggcc aggcaagattccactccgggaaaagttttccttacaactccagatgcagcaggtgtcaac cagttattttttaccactcctgatctgtctgcacaacacctgcagctcctaacagataat tctccagaccaaggaccaaataaggtttttgatctttgcgtagtatgtggagacaaagca tcaggacgtcattatggagcagtaacttgtgaaggctgcaaaggattttttaaaagaagc atccgaaaaaatttagtatattcatgtcgaggatcaaaggattgtattattaataagcac caccgaaaccgctgtcaatactgcaggttacagagatgtattgcgtttggaatgaagcaa gactctgtccaatgtgaaagaaaacccattgaagtatcacgagaaaaatcttccaactgt gccgcttcaacagaaaaaatctatatccgaaaggaccttcgtagcccattaactgcaact ccaacttttgtaacagatagtgaaagtacaaggtcaacaggactgttagattcaggaatg ttcatgaatattcatccatctggagtaaaaactgagtcagctgtgctgatgacatcagat aaggctgaatcatgtcagggagatttaagtacattggccaatgtggttacatcattagcg aatcttggaaaaactaaagatctttctcaaaatagtaatgaaatgtctatgattgaaagc ttaagcaatgatgatacctctttgtgtgaatttcaagaaatgcagaccaacggtgatgtt tcaagggcatttgacactcttgcaaaagcattgaatcctggagagagcacagcctgccag agctcagtagcgggcatggaaggaagtgtacacctaatcactggagattcaagcataaat tacaccgaaaaagaggggccacttctcagcgattcacatgtagctttcaggctcaccatg ccttctcctatgcctgagtacctgaatgtgcactacattggggagtctgcctccagactg ctgttcttatcaatgcactgggcactttcgattccttctttccaggctctagggcaagaa aacagcatatcactggtgaaagcttactggaatgaactttttactcttggtcttgcccag tgctggcaagtgatgaatgtagcaactatattagcaacatttgtcaattgtcttcacaat agtcttcaacaagatcatccaagcctagaaaacatggaacagatagagaaatttcaggaa aaggcttatgtggaattccaagattatataaccaaaacatatccagatgacacctacagg ttatccagactactactcagattgccagctttaagactgatgaatgctaccatcactgaa gaattgtttttcaaaggtctcattggcaatatacgaattgacagtgttatcccacatatt ttgaaaatggagcctgcagattataactctcaaataattggtcacagcatttga >gi568815586r:94922232_95167384|GENSCAN_predicted_peptide_6|538_aa MPRLTYLLKIMGQPGNIRTAQKWYLPELEELFLNKRAIYRKLGHAPCSENPGAVPFLLKR SRPTGSDDPGQICRRRPGDALGPRPLAVGVKGTPWPPPPTRSLVSPPSVSYRRFCALLTP ASGADATVPRLPLVDWGALREERLKKADGMWDRDSRRRELSVFGYFWGRLRISNGEKRRE EVVTFPGWGGGIGGPSSGSVSPLPPFGLEAECPARATRSQPGTLYARHCGTQFLACSHLL NFTTSSLNQGRHVIILLSFYGCRKQLRKPGDSKDHNGDGCYHRQPFHRPGSGVVSSITAK TRCHLTVTTSANSAKCLANVTMSHPQSFPMEYTLDLTPLAPNYNIITHLHHLQHNAWPKV EAPYSLLNEGMNCRWRLLTKAQTGLPRPAGCLLPWLHCLAHHVTTAAIHKAPNSEANRTV GCICSHDFLGDSCGILFSHPWDSKPVCTMELGRAAKLAPEFTKSNVKLTALATDSAEDHL AWSKDINAYNSDERTEKCPFPNIDGKDQDLAVLLGMLDPPELEEKGMGVRAYGVYFCS >gi568815586r:94922232_95167384|GENSCAN_predicted_CDS_6|1617_bp atgcccaggctgacctacttactgaagattatgggacaacctggcaacattcgtactgca caaaaatggtatctgcccgagttggaagaattatttcttaacaaaagagctatctatcgg aagcttggacatgccccctgcagtgagaaccctggggccgttccatttcttctgaaaaga tccaggcccacaggcagcgacgaccctggccagatttgccgacgccggccgggggatgcg ctgggaccgcgtccgttggcggttggggtgaagggcaccccctggcccccgcccccgacg cggtcgctggtgtccccacccagcgtttcttaccggcgcttttgcgccctgctgactccg gcgtcgggcgccgacgcgacagtcccgcggctgccactcgtggattggggggcgctccgg gaagagaggttgaagaaagccgacgggatgtgggatcgagattcacggcggagagagctt tctgtgtttgggtatttctgggggcgcctgcgcattagcaacggggagaagcggcgagag gaggtcgtgacgttcccagggtggggtggagggatcggagggccgagcagcggctctgtg agtccactgcccccttttggcttggaggcagagtgccccgccagagctacgcgttcccag ccaggtaccctatatgccaggcactgtggcacacaatttctggcatgttcacatctactg aatttcacaacatcctcactaaaccaaggaaggcacgtgattatattactttcattttat ggatgcagaaaacagctcagaaagcctggagacagtaaagaccacaatggagatgggtgt tatcatcgccaacctttccacaggcccggcagtggtgtcgtctccagcattactgccaaa acaagatgccatttaacagttactacgtcagccaactctgctaaatgcttagcaaatgtc accatgtctcaccctcagagcttccccatggaatacacattggatcttacaccccttgcc cccaactacaacatcatcacccaccttcaccatttacagcacaatgcctggcccaaggta gaggctccatattccttactgaacgaagggatgaactgcaggtggagactactaacaaaa gcacaaacaggactgcccagaccagctggctgcttgctgccgtggctacactgtctcgcc caccatgtcactactgctgccatccacaaggctcccaactcggaggctaatcgcactgtt ggctgtatctgttcccacgactttctaggagactcctgtggcattcttttttcccaccct tgggactctaaaccagtgtgcactatggagcttggcagagctgcaaagctggcaccagaa ttcaccaagagcaatgtgaagttgactgcccttgcaacagacagtgctgaggaccatctt gcctggagcaaggatatcaatgcttacaatagtgatgagcgaacagaaaaatgccctttt cccaacattgatggtaaggatcaggaccttgccgtcttgttgggcatgctggatcctcct gagttggaagaaaagggaatgggtgtcagagcttatggtgtttatttttgctcctga >gi568815586r:94922232_95167384|GENSCAN_predicted_peptide_7|172_aa MTALLVVIRKASAEYQVLSDRVISHILMLQLRPQVILNCLRRSRPCSLCQDYLKNLIEDA GDYRDTQDALAVVIEVANHANDTMKQGDNFQKLMQIQYSLNGHHEIVQPGRFNDALLYTT PVQSGMYKLNNMLSLAGMKVRKPTQEAYQNELKIESVERSFILSARCGSSPL >gi568815586r:94922232_95167384|GENSCAN_predicted_CDS_7|519_bp atgactgctctcctcgttgttattaggaaagcttctgccgaataccaggtcctaagtgat agggtgatcagtcacattcttatgctgcagttaaggccccaggtgattcttaactgttta aggagatccaggccctgctctctttgccaagattatttgaagaatctcatagaagatgct ggagattacagagacactcaagatgcccttgctgttgttatagaggtagccaaccacgcc aatgacaccatgaagcaaggagacaactttcagaaacttatgcaaattcagtacagctta aatggacaccatgaaattgtgcagcctggtcggtttaatgatgccctgctgtatacaaca ccagtgcagtctgggatgtataaactgaacaacatgctctcactggctggaatgaaggtc agaaaacctacccaagaagcctatcagaatgaattaaagattgaaagtgtagaacgttcc ttcattctctcagccaggtgtggtagctcacccctgtaa >gi568815586r:94922232_95167384|GENSCAN_predicted_peptide_8|221_aa MVFRHFMEVGWATYVAFGPHAGKFIVIIDVIDQNRALVGGPCAQVRRQAMPFKYMQLTDF ILKFPYSAHQKYVPQAWQKVDINTKWAATQWARKIEARERKAKMTDSDHFKVMKAKKMRD RIIKNEVNKLQKEALLKASLKKAPVAKGAAAAAAKFPAKKMTAVGKKAPAQKVPAQKATG QKAAPPPKAQKGQKAPAQKAPAPKASGKKAYEATIKTIKVL >gi568815586r:94922232_95167384|GENSCAN_predicted_CDS_8|666_bp atggtgttcaggcacttcatggaggttggctgggcaacctacgtcgcctttggacctcat gctggaaaattcattgtgatcatagatgttattgatcagaacagggctttggttggtgga ccttgcgctcaagtgaggagacaggccatgcctttcaagtacatgcagctcactgatttc atcctcaagtttccatacagtgcccaccagaagtatgtcccacaagcctggcagaaggta gacatcaatacaaaatgggcagccacacaatgggccaggaagattgaagccagagaaagg aaagccaagatgacagattctgatcattttaaagttatgaaggcaaagaaaatgagggac agaataatcaagaatgaagttaacaagcttcaaaaggaagctctcctgaaagcttctctc aaaaaagcacctgttgctaagggtgctgctgcagctgctgctaaatttccagcaaaaaag atgaccgctgtgggcaagaaggctccagcccagaaggttcctgcccagaaagccacaggc cagaaggcagcacctcctccaaaagctcagaagggtcaaaaagctccagcccagaaagcc cctgctccaaaggcatctggcaagaaagcatatgaggcaactataaaaacaataaaggtt ctttaa >gi568815586r:94922232_95167384|GENSCAN_predicted_peptide_9|166_aa XFVDVLKLLHIDFRDAVAHASRQLGKPVIEDRILNQILYYLPQLYELNRDLLKELEERML HWTEQQRIADIFVKKGPYLKMYSTYIKEFDKNIALLDEQCKKNPGFAAVVREFEMSPRCA NLALKHYLLKPVQRIPQYRLLLTGGLLQHFANHPVGLSISQLFLPL >gi568815586r:94922232_95167384|GENSCAN_predicted_CDS_9|501_bp nngtttgtggatgtgttaaaacttttgcatattgatttccgggatgcagtagctcatgct tccaggcaacttgggaaaccagtgattgaggaccggattctaaatcagatcctatactac ttgcctcagctgtatgagctcaaccgggatctcttgaaggaactggaggaaagaatgttg cactggactgaacaacaaagaattgctgatatctttgtaaagaagggaccatatctaaaa atgtattccacatacatcaaagaatttgataagaatatagccttgctggatgaacagtgc aagaaaaatccaggttttgctgctgttgttagagaatttgagatgagccctcgctgtgct aatctggccctcaagcactacctgctcaagccggttcagaggatcccccagtacaggctg ttgctgacaggtgggcttctccagcattttgccaaccacccagttggtttatccatcagc caactcttcttgcctttatga