GENSCAN 1.0 Date run: 4-Nov-116 Time: 18:56:18 Sequence gi568815587r:3727566_3928138 : 200573 bp : 44.71% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.16 PlyA - 3373 3368 6 1.05 1.15 Term - 4013 3822 192 2 0 77 44 136 0.936 5.52 1.14 Intr - 7759 7626 134 2 2 53 84 124 0.954 8.86 1.13 Intr - 17084 16944 141 0 0 103 97 72 0.883 9.92 1.12 Intr - 25843 25751 93 2 0 94 110 24 0.792 5.14 1.11 Intr - 33061 32974 88 1 1 106 20 63 0.471 0.84 1.10 Intr - 35474 35337 138 2 0 86 86 66 0.988 6.86 1.09 Intr - 41179 41016 164 2 2 84 91 145 0.999 14.09 1.08 Intr - 44363 44183 181 2 1 80 68 109 0.943 7.54 1.07 Intr - 46174 46067 108 1 0 57 78 58 0.782 2.18 1.06 Intr - 48456 48317 140 1 2 93 58 85 0.982 6.18 1.05 Intr - 51484 51308 177 2 0 116 92 45 0.921 7.59 1.04 Intr - 51692 51591 102 0 0 83 91 78 0.994 7.75 1.03 Intr - 69985 69835 151 1 1 29 115 44 0.013 0.94 1.02 Intr - 70118 69997 122 0 2 129 -21 69 0.010 0.31 1.01 Init - 74740 74650 91 1 1 85 62 53 0.569 3.05 1.00 Prom - 75182 75143 40 -8.96 2.00 Prom + 76212 76251 40 -4.86 2.01 Init + 83300 83319 20 1 2 77 82 16 0.487 -0.29 2.02 Intr + 83685 83859 175 0 1 98 37 249 0.751 20.74 2.03 Intr + 89824 89970 147 0 0 35 84 229 0.753 17.63 2.04 Intr + 96318 96570 253 2 1 17 20 427 0.691 25.81 2.05 Intr + 96705 96895 191 1 2 85 59 174 0.906 13.50 2.06 Intr + 97455 97563 109 2 1 22 78 204 0.927 12.66 2.07 Term + 97763 97893 131 0 2 59 46 148 0.986 6.04 2.08 PlyA + 98765 98770 6 1.05 3.03 PlyA - 98957 98952 6 1.05 3.02 Term - 100641 99998 644 1 2 107 46 830 0.892 75.03 3.01 Init - 107843 107729 115 2 1 67 84 71 0.527 4.97 3.00 Prom - 122873 122834 40 -5.26 4.03 PlyA - 124029 124024 6 1.05 4.02 Term - 127093 127059 35 2 2 100 54 51 0.581 0.65 4.01 Init - 128606 128192 415 2 1 95 60 171 0.841 11.74 4.00 Prom - 154854 154815 40 -2.06 5.03 PlyA - 155263 155258 6 1.05 5.02 Term - 165357 164801 557 1 2 -8 42 558 0.672 36.09 5.01 Init - 170310 170178 133 0 1 78 47 24 0.161 -2.40 5.00 Prom - 170434 170395 40 -2.46 6.02 PlyA - 170630 170625 6 -1.95 6.01 Sngl - 172212 170641 1572 0 0 42 39 494 0.937 35.28 6.00 Prom - 175433 175394 40 -3.56 7.03 PlyA - 175504 175499 6 1.05 7.02 Term - 195203 195140 64 1 1 107 47 46 0.616 -0.24 7.01 Intr - 200082 199953 130 1 1 63 108 50 0.373 4.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 70118 69991 128 0 2 129 48 76 0.888 6.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:3727566_3928138|GENSCAN_predicted_peptide_1|673_aa MGYLPTKVPNNVSLEITKVTGNSSLEYSSSIRFQKTRCPMNNRVDPEWRPRPPIRGRRGR SVRNEPLTPGGSGPLRAAPEAAVGGRGGSGGGDGFVGAARCSVSGGWQQGTPDTSPSPPN RDTGFGTTSGGAFGTSAFGSSNNTGGLFGNSQTKPGGLFGTSSFSQPATSTSTGFGFGTS TGTANTLFGTASTGTSLFSSQNNAFAQNKPTGFGNFGTSTSSGGLFGTTNTTSNPFGSTS GSLFGPSSFTAAPTGTTIKFNPPTGTDTMVKAGVSTNISTKHQCITAMKEYESKSLEELR LEDYQANRKGPQNQVGAGTTTGLFGSSPATSSATGLFSSSTTNSGFAYGQNKTAFGTSTT GFGTNPGGLFGQQNQQTTSLFSKPFGQATTTQNTGFSFGNTSTIGQPSTNTMGLFGVTQA SQPGGLFGTATNTSTGTAFGTGTGLFGQTNTGFGAVGSTLFGNNKLTTFGSSTTSAPSFG TTSGGLFGFGTNTSGNSIFGSKPAPGTLGTGLGAGFGTALGAGQASLFGNNQPKIGGPLG TGAFGAPGFNTTTATLGFGAPQAPVALTDPNASAAQQAVLQQHINSLTYSPFGDSPLFRN PMSDPKKKEERLKPTNPAAQKALTTPTHYKLTPRPATRVRPKALQTTGTAKSHLFDGLDD DEPSLANGAFMPK >gi568815587r:3727566_3928138|GENSCAN_predicted_CDS_1|2022_bp atgggctacttgcctaccaaagtacccaacaacgtatctttagaaatcaccaaagtcaca ggaaattcttccttggagtattcctcatccattcggtttcaaaagacgcgttgcccaatg aataacagggtcgacccagagtggcggccgcgtccaccaattcgcggacggcgagggcgg agcgtccgcaacgagcctctgacgccgggcggctcgggccctctgcgcgctgcgcccgaa gcggcggtcggtggcaggggtggtagcggcggcggcgacggtttcgtgggggccgcgcgc tgctctgtgagcggcgggtggcagcaggggactcctgacacttccccttccccaccgaac cgcgatactggctttggcactactagtggaggggcatttggaacatctgcatttggttct agcaacaatactggaggcctctttggaaattcacagactaaaccaggaggattgtttgga accagttcatttagccagccagctacctccacaagcactggctttgggtttggtacgtca acaggaacagcaaataccttgtttggaactgcaagcacagggaccagtctcttctcatcc caaaacaatgcctttgcacaaaataaaccaactggctttggcaattttggaaccagtact agcagtggaggactctttggaaccacaaataccacctctaatccttttggcagcacatct ggctccctctttgggccaagtagttttacagctgctcctactgggactactattaaattt aaccctccaactggtacagatactatggtcaaagctggagttagcactaacataagtacc aagcaccagtgtattactgctatgaaagaatatgaaagcaagtcactagaggaacttcgt ttagaggattatcaggctaacaggaagggcccacagaaccaggtgggagcaggtaccaca actggcttgtttgggtcttctccagccacttccagcgcaacaggactcttcagctcctcc accactaattcaggctttgcatatggtcagaacaaaactgcctttggaactagtacaact ggatttggaacaaatccaggtggtctctttggccaacagaatcagcagactaccagcctc ttcagcaaaccatttggccaggctacaaccacccagaacactggcttttcctttggtaat accagcaccataggacagccaagcaccaacaccatgggattatttggagtaacccaagcc tcacagcctggaggtctttttgggacagctacaaacaccagcactgggacagcatttgga acaggaacaggtctctttgggcagaccaatactggatttggtgctgttggttcgaccctg tttggcaataacaagcttactacatttggaagcagcacaaccagtgcaccttcatttggt acaaccagtggcgggctctttggttttggcacaaataccagtgggaatagtatttttgga agtaaaccagcacctgggactcttggaactgggcttggtgcaggatttggaacagctctt ggtgctggacaggcatctttgtttgggaacaaccaacctaagattggagggcctcttggt acaggagcctttggggcccctggatttaatactacgacagccactttgggctttggagcc ccccaggccccagtagctttgacagatccaaatgcttctgctgcccagcaggctgttctc cagcagcacatcaatagtctaacatactcaccttttggagactctcctctcttccggaat ccgatgtcagaccctaagaagaaggaagagagattgaaaccaacaaatccagcagcccag aaggctcttactacacctactcattataaactgacaccccgccctgccactagagtccgg ccaaaggctttacaaacaacaggcacagccaagtcacatctctttgatgggctggatgac gatgaaccatccctagccaatggagcattcatgcccaagtga >gi568815587r:3727566_3928138|GENSCAN_predicted_peptide_2|341_aa MGHAIHWSDKMYQVPLPLDRDGTLVRLRFTMVALVTVCCPLVAFLFCILWSLLFHFKETT ATHCGPLDPDGTLFRLRFTAMVWWAITFPVFGFFFCIIWSLVFHFEYTVATDCGVPNYLP SVSSAIGGEVPQRYVWRFCIGLHSAPRFLVAFAYWNHYLSCTSPCSCYRPLCRLNFGLNV VENLALLVLTYVSSSEDFTIHENAFIVFIASSLGHMLLTCILWRLTKKHTVSQEVRSIPS GGSKAAQKKIKDICPQDSGNGEDRKSYSWKQRLFIINFISFFSALAVYFRHNMYCEAGVY TIFAILEYTVVLTNMAFHMTAWWDFGNKELLITSQPEEKRF >gi568815587r:3727566_3928138|GENSCAN_predicted_CDS_2|1026_bp atgggccatgccatccactggtctgacaagatgtaccaggtcccactaccactggatcgg gatgggaccctggtacggctccgcttcaccatggtggccctggtcacggtctgctgtcca cttgtcgccttcctcttctgcatcctctggtccctgctcttccacttcaaggagacaacg gccacacactgtgggcctttggaccccgatgggaccttgttccggcttcgcttcacagcc atggtctggtgggccatcacttttcctgtgttcggcttcttcttctgcatcatctggtcc ctggtgttccactttgagtacacggtggccactgactgtggggtgcccaattacctgccc tcggtgagctcagccatcggcggggaggtgccccagcgctacgtgtggcgtttctgcatc ggcctgcactcggcgcctcgcttcttggtggccttcgcctactggaaccactacctcagc tgcacctccccgtgttcctgctatcgcccgctctgccgcctcaacttcggcctcaatgtc gtggagaacctcgcgttgctagtgctcacttatgtctcctcctccgaggacttcaccatc cacgaaaatgctttcattgtgttcattgcctcatccctcgggcacatgctcctcacctgc attctctggcggttgaccaagaagcacacagtaagtcaggaggtacggtctatccctagc gggggctccaaggcagcccagaagaaaatcaaggacatctgtcctcaggattcgggtaat ggtgaggatcgcaagtcctacagctggaaacagcggctcttcatcatcaacttcatctcc ttcttctcggcgctggctgtctactttcggcacaacatgtattgtgaggctggagtgtac accatctttgccatcctggagtacactgttgtcttaaccaacatggcgttccacatgacg gcctggtgggacttcgggaacaaggagctgctcataacctctcagcctgaggaaaagcga ttctga >gi568815587r:3727566_3928138|GENSCAN_predicted_peptide_3|252_aa MKSTVNAGKGKAEEGVAVMSWRSWGTVLGLKDRKDLKGGHCSQRGPEERGGTASTTATAP TMQSIKCVVVGDGAVGKTCLLICYTTNAFPKEYIPTVFDNYSAQSAVDGRTVNLNLWDTA GQEEYDRLRTLSYPQTNVFVICFSIASPPSYENVRHKWHPEVCHHCPDVPILLVGTKKDL RAQPDTLRRLKEQGQAPITPQQGQALAKQIHAVRYLECSALQQDGVKEVFAEAVRAVLNP TPIKRGRSCILL >gi568815587r:3727566_3928138|GENSCAN_predicted_CDS_3|759_bp atgaaaagtacggtcaatgctgggaaaggtaaggcagaagagggggttgctgtgatgtcc tggagaagctggggaacagtactgggcctgaaggacaggaaggatctgaaggggggtcac tgcagccagaggggtccagaagagagaggaggcactgcctccactacagcaactgcaccc acgatgcagagcatcaagtgcgtggtggtgggtgatggggctgtgggcaagacgtgcctg ctcatctgctacacaactaacgctttccccaaagagtacatccccaccgtgttcgacaat tacagcgcgcagagcgcagttgacgggcgcacagtgaacctgaacctgtgggacactgcg ggccaggaggagtatgaccgcctccgtacactctcctaccctcagaccaacgttttcgtc atctgtttctccattgccagtccgccgtcctatgagaacgtgcggcacaagtggcatcca gaggtgtgccaccactgccctgatgtgcccatcctgctggtgggcaccaagaaggacctg agagcccagcctgacaccctacggcgcctcaaggagcagggccaggcgcccatcacaccg cagcagggccaggcactggccaagcagatccacgctgtgcgctacctcgaatgctcagcc ctgcaacaggatggtgtcaaggaagtgttcgccgaggctgtccgggctgtgctcaacccc acgccgatcaagcgtgggcggtcctgcatcctcttgtga >gi568815587r:3727566_3928138|GENSCAN_predicted_peptide_4|149_aa MPSRLRSCPATREPAPSNSQRSSAPSLGAGGGLAPQVQQWVRRGCGGLSSCRFRRLRAGW LRGCCQLRRLLHPCGRGRPESPRAQKWRRKRREEKRGEKGAPDLALPKSGEQQKLGRACD PQGPEGAGVSGILGLLGRATSLVEEVSAT >gi568815587r:3727566_3928138|GENSCAN_predicted_CDS_4|450_bp atgccctcgcggctccgcagctgtccagccacccgcgagcctgccccctccaacagccaa aggtcaagtgctccaagtttgggtgcgggaggagggctggctcctcaggtccaacagtgg gtccgccggggctgcggcggcctgagctcgtgccgcttccggagacttcgggcagggtgg ctgcggggctgctgtcagctccgcagattactacacccctgcggaagggggcggccggag tctccgcgggcacagaagtggaggaggaagagaagagaagagaagagaggcgagaagggg gcacctgacctcgccctgccgaagagcggcgagcagcagaagctggggagggcgtgcgac ccgcagggtcctgagggcgcgggcgtctccgggatcctgggcctcctaggccgagctacc tctctggtggaggaggtgtcagcgacttga >gi568815587r:3727566_3928138|GENSCAN_predicted_peptide_5|229_aa MEYYAAIKNDEFVSFVGTWMKLEIIILSKLSQEQKTKHHIFSLIDAAAAGSPVLSAVVNP TVFFDIAVNSEPLGHVSFKLFADKVPKTAENFRALSTGEKGFGYKGSCFHRIIPGFMCQG GDFTHHNGTSSKSIYGEKFDDENFILKHTGPGILSMANAGPNTNSSQFFICTAKTEWLDG KHVVFGKAKEGMNIVEAIEHFGSRNGKTSKKITTADCGQLLISLTCVLS >gi568815587r:3727566_3928138|GENSCAN_predicted_CDS_5|690_bp atggaatactatgcagccataaaaaatgatgagttcgtgtcctttgtagggacatggatg aaattggaaatcatcattctcagtaaactatcgcaagaacaaaaaaccaaacaccacata ttctcactcatagatgctgctgctgccgggagccccgtactatcagccgtggttaacccc accgtgttcttcgacattgctgtcaacagcgagcccttgggccatgtctccttcaagcta tttgcagacaaggttccaaagacagcagaaaactttcgtgctctgagcactggagagaaa ggatttggttataagggttcctgctttcacagaattattccagggtttatgtgtcagggt ggtgacttcacacaccataatggcactagcagcaagtccatctacggggagaaatttgat gatgagaacttcatcctaaagcatacaggtcctggcatcttgtccatggcaaatgctgga cccaacacaaacagttcccagtttttcatctgcactgccaagactgagtggttggatggc aagcatgtggtctttggcaaggcgaaagaaggcatgaatattgtggaggccatagagcac tttgggtccaggaatggcaagaccagcaagaagatcaccactgctgactgtggacaactc ttaataagtttgacttgtgttttatcttag >gi568815587r:3727566_3928138|GENSCAN_predicted_peptide_6|523_aa MIISIDAEKAFDKIQQRFMLKTLNKLGIDGMYLKIIRAIYDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIRLGKEEVKLSLFADDMIVYLENPIVS AQNLLKLISNFSKLSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRVKYLGIQLT RDVKDLFKENYKPLLSEIKEDTNKWKNIPCSWVGRINIVKMAMLPKVIYRFNAIPIKLPM TFFTELEKTTLKFMWNQKRAHIAKSVLSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNR DIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLT PYTKINSTWIKDLNVRPKIIKTLEENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDLI KLKSFCTAKETTIRVNRQLTKWEKIFTTYSSDKGLISRIYNELKQIYKKKTNNPHQKVGK GHEQTLLKRRHLCSQKTHEKMLTITGHQRNANQNHNEIPSHTS >gi568815587r:3727566_3928138|GENSCAN_predicted_CDS_6|1572_bp atgattatctcaatagatgcagaaaaggcctttgacaaaattcaacaacgcttcatgcta aaaactctcaataaattaggtattgatgggatgtatctcaaaataataagagctatctat gacaaacccacagccaatatcatactgaatggacaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccagggcaattaggcaggagaaggaaataaagggtattcgattaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgtatatctagaaaaccccattgtctca gcccaaaatctccttaagctgataagcaacttcagcaaactctcaggatacaaaatcaat gtacaaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagagtaaaatacctaggaatccaacttaca agggatgtgaaggacctcttcaaggagaactacaaaccactgctcagtgaaataaaagag gatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaa atggccatgctgcccaaggtaatttatagattcaatgccatccccatcaaactaccaatg actttcttcacagaattggaaaaaactactttaaagttcatgtggaaccaaaaaagagcc cacatcgccaagtcagtcctaagccaaaagaacaaagctggaggcatcacgctacctgac ttcaaactatactacaaagctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaacagaacagagccctcagaaataacgccgcatatctacaactat ctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaattaattcaacatggattaaagacttaaacgttagacctaaaatcata aaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaactgaca aaatgggagaaaattttcacaacctactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaaacaaacaacccccatcaaaaagtgggcaaa ggacatgaacagacacttctcaaaagaagacatttatgcagccaaaagacacatgaaaaa atgctcaccatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatct cacaccagttag >gi568815587r:3727566_3928138|GENSCAN_predicted_peptide_7|64_aa XTQFATCTTITAPCFFASQLFRVKVHDEWLRPKEPRANCEKGYKKEADPVKEEESLKEVF LPRD >gi568815587r:3727566_3928138|GENSCAN_predicted_CDS_7|195_bp ngaactcaatttgctacctgcaccaccatcactgcaccctgcttctttgcatcacagctc ttcagagtcaaagtccatgatgagtggctaagaccaaaagagcctagggcaaactgcgaa aaaggttacaagaaagaagcagacccagttaaggaagaagaatccttaaaggaggttttc ctgcccagggactga