GENSCAN 1.0 Date run: 11-Nov-116 Time: 17:00:43 Sequence gi568815594f:54158769_54395269 : 236501 bp : 43.08% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 7355 7269 87 2 0 82 97 8 0.158 1.27 1.02 Intr - 8729 8608 122 0 2 72 56 58 0.262 1.21 1.01 Init - 11590 11515 76 1 1 81 96 70 0.433 8.35 1.00 Prom - 16842 16803 40 -3.36 2.00 Prom + 18836 18875 40 -6.16 2.01 Init + 19580 19634 55 1 1 69 65 53 0.608 2.62 2.02 Intr + 28414 28518 105 2 0 61 39 91 0.082 1.69 2.03 Intr + 44790 44840 51 1 0 49 95 64 0.249 2.08 2.04 Intr + 58966 59076 111 2 0 98 3 101 0.453 2.95 2.05 Intr + 67306 67417 112 2 1 69 70 70 0.623 2.74 2.06 Intr + 71750 71838 89 2 2 66 110 16 0.589 1.21 2.07 Intr + 72342 72443 102 1 0 96 35 80 0.550 3.65 2.08 Intr + 72853 72984 132 2 0 58 42 100 0.458 3.02 2.09 Intr + 74334 74987 654 1 0 63 48 209 0.569 6.25 2.10 Term + 75325 75701 377 2 2 -12 54 260 0.741 7.50 2.11 PlyA + 79440 79445 6 1.05 3.00 Prom + 82153 82192 40 -4.36 3.01 Sngl + 85843 86172 330 0 0 88 44 308 0.989 22.42 3.02 PlyA + 86256 86261 6 1.05 4.00 Prom + 87248 87287 40 -9.06 4.01 Init + 88140 88782 643 2 1 55 77 286 0.967 19.69 4.02 Intr + 89432 89720 289 0 1 75 8 177 0.025 4.70 4.03 Intr + 92937 92998 62 0 2 66 68 36 0.033 -2.22 4.04 Intr + 102327 102644 318 1 0 86 109 207 0.941 18.53 4.05 Intr + 104899 105159 261 2 0 79 75 244 0.979 19.66 4.06 Intr + 106151 106281 131 0 2 75 66 125 0.982 9.41 4.07 Intr + 108521 108692 172 1 1 65 43 196 0.748 12.32 4.08 Intr + 108784 108973 190 2 1 75 98 134 0.999 11.74 4.09 Intr + 111865 111980 116 1 2 47 95 81 0.987 4.79 4.10 Intr + 113626 113684 59 2 2 100 89 27 0.008 2.60 4.11 Intr + 114752 114962 211 1 1 43 81 208 0.007 14.09 4.12 Intr + 115763 115857 95 0 2 69 76 76 0.988 4.18 4.13 Intr + 116073 116205 133 2 1 81 111 73 0.999 9.12 4.14 Intr + 118620 118724 105 1 0 92 111 37 0.990 6.59 4.15 Intr + 119128 119238 111 2 0 86 80 61 0.978 5.45 4.16 Intr + 119594 119747 154 0 1 102 90 131 0.996 13.83 4.17 Intr + 121548 121714 167 0 2 94 66 88 0.986 6.80 4.18 Intr + 126603 126718 116 1 2 86 101 70 0.809 8.27 4.19 Intr + 127073 127195 123 1 0 105 46 128 0.999 11.08 4.20 Intr + 128662 128773 112 0 1 137 52 106 0.999 11.85 4.21 Intr + 130031 130130 100 0 1 74 39 127 0.996 5.57 4.22 Intr + 130241 130346 106 2 1 100 93 37 0.910 5.62 4.23 Intr + 131545 131786 242 0 2 53 70 363 0.995 27.35 4.24 Term + 136357 136504 148 1 1 114 36 341 0.998 28.87 4.25 PlyA + 138536 138541 6 1.05 5.03 PlyA - 139887 139882 6 1.05 5.02 Term - 143110 143094 17 2 2 95 54 23 0.055 -2.00 5.01 Init - 150419 150347 73 2 1 77 80 70 0.773 6.33 5.00 Prom - 158113 158074 40 -3.76 6.09 PlyA - 158600 158595 6 1.05 6.08 Term - 164114 164001 114 2 0 73 50 128 0.863 6.07 6.07 Intr - 177244 177175 70 0 1 101 25 76 0.037 1.78 6.06 Intr - 186836 186808 29 2 2 60 69 63 0.006 -1.49 6.05 Intr - 194220 194141 80 1 2 94 78 39 0.019 2.77 6.04 Intr - 201210 201163 48 1 0 105 103 23 0.096 4.15 6.03 Intr - 232338 232257 82 0 1 67 92 14 0.121 -1.09 6.02 Intr - 235778 235727 52 1 1 89 87 54 0.605 4.31 6.01 Init - 236222 236179 44 2 2 80 99 1 0.293 0.29 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100049 49 1 1 67 96 47 0.811 2.60 S.002 Intr + 114769 114962 194 1 2 88 81 238 0.988 22.24 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:54158769_54395269|GENSCAN_predicted_peptide_1|95_aa MSKKHNGFGSQGFVRKWSAPVTCNPDMVFIPSGRYGNKAPSESREQPSPGTDPASTLILD FSASRTTSILHQSSRLWEVDHVLQQFPDSWNTVFK >gi568815594f:54158769_54395269|GENSCAN_predicted_CDS_1|285_bp atgtccaagaaacacaacggatttggctctcagggttttgttcggaaatggtcagctcct gtgacttgcaatccagacatggtattcatcccctctggaagatacggcaacaaggcacca tccgaaagcagagagcagccctcgccaggcactgaccctgccagcaccttaatcttggac ttctcagcctctagaactacatccattcttcaccaatcatccaggctttgggaagtagac catgtactgcagcaatttcctgactcctggaacaccgtcttcaag >gi568815594f:54158769_54395269|GENSCAN_predicted_peptide_2|595_aa MRKTSIDTITAAATTIIQVEETEAQKGVTATVTVVQNPNVSDFYPTLCSAALSVLEAKKS KIKALAGSLSALTYEPQCPEDDSKQRRTFKWNLRLVKEWPGTAPDFRGQTDLNSSRKVEL AVELGGRGGEDTKSQLSGKRPQSQRQEERKGLQEGAELLVRKLGTTWNWGNLFAARAPVP HFAPRIVLRPQDVRKECLEELATHSIFLDEEALPRLPDRRKADQARFDALVRPCFGACSF GLTQVQLDLVHKIENNANAERRAWGAAASAALRFRARRAGAQGRRSPERLGIAANPGTTK PPRTWWRWEGERTAQCSSGSRGQKPDSGGGWTVRAQSLPGEVCAQHSRLRCRGPGAWAAG ELAGEPGKIRGAVAPSSGLRGRIPLSPAPGGRRRYLSGAQRWNPGGPERRLIWVTFPLLR AASGLLAGSVWAREAPGGREGAEVSGFGLRMGRPCNSMWEKQGSVEEDGSPQQRRLPGSE DVFPKPRCPGAPAPGDGDDDGDDGEMSPNSPGVPRRFSHFQQQFSYFCLNPAGFQLRSPR KLKVQPKEGRDWKLTFGVVELRIPRSGVCVATRGRTSLDQHCHLLATRKNYCDPL >gi568815594f:54158769_54395269|GENSCAN_predicted_CDS_2|1788_bp atgcgcaaaacctcaatagacactatcactgctgctgccaccactatcatccaagttgag gaaactgaggcccagaaaggtgtgacagccacagtcacagtagtgcagaaccctaatgtg tccgacttttatccaacactctgctccgcagcgctgtcggttctagaggctaagaagtcc aagatcaaggcactggcaggatccctgtctgcactgacatacgagccccagtgtcctgag gatgactccaaacagcgtcggaccttcaagtggaacctcaggcttgttaaggagtggcct ggaactgcaccagattttagggggcaaacggacctgaattcttcaaggaaagtagaattg gcggtggagctggggggcagaggaggagaagacacaaaaagccaactcagtgggaagcgc cctcaaagccaaaggcaggaagagaggaaaggtctccaggaaggtgccgaacttcttgtg aggaagttagggacgacttggaactggggaaacttgtttgcagccagagcgccggtcccg cacttcgctccccgaattgtgctaagacctcaggatgtgcgcaaggagtgcctggaagaa cttgccactcactccatcttcctggatgaggaggccctgcctcgactccccgacaggcgc aaggccgaccaggctcgctttgacgccctggtcaggccctgcttcggagcttgcagtttt gggttgacccaagtgcagctggacctcgttcacaagattgaaaacaatgcaaacgcagag cgtcgtgcctggggcgcagcggcctctgcagctctgaggttcagagcgcggcgcgcaggg gcgcaggggcgcaggtccccggagcggctgggcatcgccgccaaccccgggacaaccaag ccgccgaggacttggtggaggtgggagggagagcggaccgcgcagtgttcctctgggtcc cggggtcagaagccggacagtggcgggggctggacagtccgagcgcagagcctcccgggc gaagtctgcgcgcagcactcccggctgcgctgtcggggaccaggagcctgggctgcgggc gagctagctggcgagccgggtaagattcgcggggctgtcgcgccctctagtgggctccgg ggacgcatcccgctttccccagccccgggcggtcggcgccgctatctcagtggagcgcag cgctggaaccccgggggcccggagcggaggctaatctgggtcacctttcccctcctccgg gcagccagtggactcctcgcaggctctgtgtgggcccgggaggcgccgggtggacgagag ggggccgaggtctctgggtttggattgagaatgggacgtccttgtaattcaatgtgggag aagcaggggagtgtggaggaggacgggtccccccagcagcgtcgtcttcccggctccgaa gacgtcttcccgaagccccgctgccccggcgcaccagctcccggtgatggggatgatgat ggggatgatggggagatgtccccgaattctcctggggtcccgaggagatttagccatttt caacagcagttttcctatttctgcttaaaccccgcgggctttcaactgaggtcaccacga aagctaaaagtacaacctaaggaaggaagagactggaaattaacatttggggtagtggag ctcagaattcctcgcagcggcgtctgtgtcgcaacccgtggacgcacgtccttggaccaa cactgccatctgctggccactcgtaagaactactgcgacccgctgtga >gi568815594f:54158769_54395269|GENSCAN_predicted_peptide_3|109_aa MGKKQSRKTGNSEKQSASPPPEERSSSPAMEQSWTENDFDELREEGFRRSNYSELQEEIQ TKGKEVENFEKTLDESITRITNTEKCLKEQMELKAKAQELHEECRSLRS >gi568815594f:54158769_54395269|GENSCAN_predicted_CDS_3|330_bp atggggaaaaaacagagcagaaaaactggaaactctgaaaagcagagcgcctctcctcct ccagaggaacgcagctcctcaccagcaatggaacaaagctggacggagaatgactttgac gagttgagagaagaaggcttcagacgatcaaactactccgagctacaggaggaaattcaa accaaaggcaaagaagttgaaaactttgaaaaaactttagacgaatctataactagaata accaatacagagaagtgcttaaaggagcagatggagctgaaagccaaggctcaagaacta catgaagaatgcagaagccttaggagctga >gi568815594f:54158769_54395269|GENSCAN_predicted_peptide_4|1387_aa MQTTIREYHKHLYANKLENLEEVDKFLDTYTLPRLNQEEVDSLNRPITGSEIVAIISSLP TKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEGSIILIPKPGRDTTK KENFRPISLVNIDAKILNKILANRIQQHIKKLIHHDQVGFMPGMQGWFNIPKSINVIQHV NRTKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLELEKTTLKFMWNQKRARIAKSVLSQK NKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITPHIYNYLIFDKPEKNK QWGKDSLFNKWPRSPKLSWDIKKKGMHPTYTGLSLILCQLSLPSILPNENEKVVQLNSSF SLRCFGESEVSWQYPMSEEESSDVEIRNEENNSGLFVTVLEVSSASAAHTGLYTCYYNHT QTEENELEGRHIYIYVPDPDVAFVPLGMTDYLVIVEDDDSAIIPCRTTDPETPVTLHNSE GVVPASYDSRQGFNGTFTVGPYICEATVKGKKFQTIPFNVYALKATSELDLEMEALKTVY KSGETIVVTCAVFNNEVVDLQWTYPGEVKGKGITMLEEIKVPSIKLVYTLTVPEATVKDS GDYECAARQATREVKEMKKVTISVHEKGFIEIKPTFSQLEAVNLHEVKHFVVEVRAYPPP RISWLKNNLTLIENLTEITTDVEKIQEIRYRSKLKLIRAKEEDSGHYTIVAQNEDAVKSY TFELLTQVPSSILDLVDDHHGSTGGQTALFLSRCNNETSWTILANNVSNIITEIHSRDRS TVEGRVTFAKVEETIAVRCLAKNLLGAENRELKLVAPTLRSELTVAAAVLVLLVIVIISL IVLVVIWKQKPRYEIRWRVIESISPDGHEYIYVDPMQLPYDSRWEFPRDGLVLGRVLGSG AFGKVVEGTAYGLSRSQPVMKVAVKMLKPTARSSEKQALMSELKIMTHLGPHLNIVNLLG ACTKSGPIYIITEYCFYGDLVNYLHKNRDSFLSHHPEKPKKELDIFGLNPADESTRSYVI LSFENNGDYMDMKQADTTQYVPMLERKEVSKYSDIQRSLYDRPASYKKKSMLDSEVKNLL SDDNSEGLTLLDLLSFTYQVARGMEFLASKNCVHRDLAARNVLLAQGKIVKICDFGLARD IMHDSNYVSKGSTFLPVKWMAPESIFDNLYTTLSDVWSYGILLWEIFSLGGTPYPGMMVD STFYNKIKSGYRMAKPDHATSEVYEIMVKCWNSEPEKRPSFYHLSEIVENLLPGQYKKSY EKIHLDFLKSDHPAVARMRVDSDNAYIGVTYKNEEDKLKDWEGGLDEQRLSADSGYIIPL PDIDPVPEEEDLGKRNRHSSQTSEESAIETGSSSSTFIKREDETIEDIDMMDDIGIDSSD LVEDSFL >gi568815594f:54158769_54395269|GENSCAN_predicted_CDS_4|4164_bp atgcaaactaccatcagagaataccacaaacacctctatgcaaataaactagaaaatcta gaagaagtggataaattcctcgacacatacaccctcccaagactaaaccaggaagaagtt gattctctgaatagaccaataacaggctctgaaattgtggcaataatcagtagcttacca accaaaaagagtccaggaccagatggattcacagccgaattctaccagaggtacaaggag gaactggtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccct aactcattttatgagggcagcatcatcctgataccaaagccaggcagagacacaaccaaa aaagagaattttagaccaatatccttggtgaacattgatgcaaaaatcctcaataaaata ctggcaaacagaatccagcagcacatcaaaaagcttatccaccatgatcaagttggcttc atgcctgggatgcaaggctggttcaatatacccaaatcaataaatgtaatccagcatgta aacagaaccaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgac aaaattcaacaacccttcatgctaaaaactctcaataaattagaattggaaaaaactact ttaaagttcatgtggaaccaaaaaagagcccgcatcgccaagtcagtcctaagccaaaag aacaaagctggaggcatcacgctacctgacttcaaactatactacaaagctacagtaacc aaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaacagaaccc tcagaaataacgccgcatatctacaactatctgatctttgacaaacctgagaaaaacaag caatggggaaaggattctttatttaataaatggcccaggagccccaagttatcttgggac atcaagaagaaaggaatgcacccaacttatacagggctgagcctaatcctctgccagctt tcattaccctctatccttccaaatgaaaatgaaaaggttgtgcagctgaattcatccttt tctctgagatgctttggggagagtgaagtgagctggcagtaccccatgtctgaagaagag agctccgatgtggaaatcagaaatgaagaaaacaacagcggcctttttgtgacggtcttg gaagtgagcagtgcctcggcggcccacacagggttgtacacttgctattacaaccacact cagacagaagagaatgagcttgaaggcaggcacatttacatctatgtgccagacccagat gtagcctttgtacctctaggaatgacggattatttagtcatcgtggaggatgatgattct gccattataccttgtcgcacaactgatcccgagactcctgtaaccttacacaacagtgag ggggtggtacctgcctcctacgacagcagacagggctttaatgggaccttcactgtaggg ccctatatctgtgaggccaccgtcaaaggaaagaagttccagaccatcccatttaatgtt tatgctttaaaagcaacatcagagctggatctagaaatggaagctcttaaaaccgtgtat aagtcaggggaaacgattgtggtcacctgtgctgtttttaacaatgaggtggttgacctt caatggacttaccctggagaagtgaaaggcaaaggcatcacaatgctggaagaaatcaaa gtcccatccatcaaattggtgtacactttgacggtccccgaggccacggtgaaagacagt ggagattacgaatgtgctgcccgccaggctaccagggaggtcaaagaaatgaagaaagtc actatttctgtccatgagaaaggtttcattgaaatcaaacccaccttcagccagttggaa gctgtcaacctgcatgaagtcaaacattttgttgtagaggtgcgggcctacccacctccc aggatatcctggctgaaaaacaatctgactctgattgaaaatctcactgagatcaccact gatgtggaaaagattcaggaaataaggtatcgaagcaaattaaagctgatccgtgctaag gaagaagacagtggccattatactattgtagctcaaaatgaagatgctgtgaagagctat acttttgaactgttaactcaagttccttcatccattctggacttggtcgatgatcaccat ggctcaactgggggacagacggccctttttctctctagatgtaataatgaaacttcctgg actattttggccaacaatgtctcaaacatcatcacggagatccactcccgagacaggagt accgtggagggccgtgtgactttcgccaaagtggaggagaccatcgccgtgcgatgcctg gctaagaatctccttggagctgagaaccgagagctgaagctggtggctcccaccctgcgt tctgaactcacggtggctgctgcagtcctggtgctgttggtgattgtgatcatctcactt attgtcctggttgtcatttggaaacagaaaccgaggtatgaaattcgctggagggtcatt gaatcaatcagcccagatggacatgaatatatttatgtggacccgatgcagctgccttat gactcaagatgggagtttccaagagatggactagtgcttggtcgggtcttggggtctgga gcgtttgggaaggtggttgaaggaacagcctatggattaagccggtcccaacctgtcatg aaagttgcagtgaagatgctaaaacccacggccagatccagtgaaaaacaagctctcatg tctgaactgaagataatgactcacctggggccacatttgaacattgtaaacttgctggga gcctgcaccaagtcaggccccatttacatcatcacagagtattgcttctatggagatttg gtcaactatttgcataagaatagggatagcttcctgagccaccacccagagaagccaaag aaagagctggatatctttggattgaaccctgctgatgaaagcacacggagctatgttatt ttatcttttgaaaacaatggtgactacatggacatgaagcaggctgatactacacagtat gtccccatgctagaaaggaaagaggtttctaaatattccgacatccagagatcactctat gatcgtccagcctcatataagaagaaatctatgttagactcagaagtcaaaaacctcctt tcagatgataactcagaaggccttactttattggatttgttgagcttcacctatcaagtt gcccgaggaatggagtttttggcttcaaaaaattgtgtccaccgtgatctggctgctcgc aacgtcctcctggcacaaggaaaaattgtgaagatctgtgactttggcctggccagagac atcatgcatgattcgaactatgtgtcgaaaggcagtacctttctgcccgtgaagtggatg gctcctgagagcatctttgacaacctctacaccacactgagtgatgtctggtcttatggc attctgctctgggagatcttttcccttggtggcaccccttaccccggcatgatggtggat tctactttctacaataagatcaagagtgggtaccggatggccaagcctgaccacgctacc agtgaagtctacgagatcatggtgaaatgctggaacagtgagccggagaagagaccctcc ttttaccacctgagtgagattgtggagaatctgctgcctggacaatataaaaagagttat gaaaaaattcacctggacttcctgaagagtgaccatcctgctgtggcacgcatgcgtgtg gactcagacaatgcatacattggtgtcacctacaaaaacgaggaagacaagctgaaggac tgggagggtggtctggatgagcagagactgagcgctgacagtggctacatcattcctctg cctgacattgaccctgtccctgaggaggaggacctgggcaagaggaacagacacagctcg cagacctctgaagagagtgccattgagacgggttccagcagttccaccttcatcaagaga gaggacgagaccattgaagacatcgacatgatggatgacatcggcatagactcttcagac ctggtggaagacagcttcctgtaa >gi568815594f:54158769_54395269|GENSCAN_predicted_peptide_5|29_aa MDVKVAMKVSGIKAKTGTAEVKLPAPTYE >gi568815594f:54158769_54395269|GENSCAN_predicted_CDS_5|90_bp atggatgtgaaggtggccatgaaagtatctggaattaaagcaaagacagggactgcagaa gtaaagctaccagctcccacctatgagtga >gi568815594f:54158769_54395269|GENSCAN_predicted_peptide_6|172_aa MGGGEGNAEEAFSSRLRNEDFKKLSFDISQIITLRNPAGSPEITSAVFYHLLSLSLQPDM IDLQLLSFHRAYGHSATWMKLEAMIFRETTQTQKDKYCMFSFNFQYYGEEEWKLVQHVSN QVLLYKVTTIAAQANAQRFHVKNPASSSTLYSFVDKAGGGQIGPQEIWVFED >gi568815594f:54158769_54395269|GENSCAN_predicted_CDS_6|519_bp atgggtggtggggaggggaacgctgaggaggccttctcctccagattacggaatgaagac tttaagaagttgagctttgatatttcacagatcattactctaaggaatcctgctggaagc ccagaaatcacatctgctgtgttctaccatctgctgtccttatctctccagccagatatg atagacctccagttactaagttttcacagagcctacggccactcagcaacatggatgaaa ctggaggccatgatcttcagagaaacaactcagacacagaaagacaaatactgtatgttc tcatttaacttccagtactatggtgaagaggagtggaaacttgtccagcatgtcagtaac caggtcttgctctacaaggtgacaaccatcgctgcccaggccaatgctcagagatttcac gtgaagaatccggcttcttcatccactctctactccttcgtggacaaggcaggtggagga cagattggcccacaagaaatctgggtgtttgaagattag