GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:47:12 Sequence gi568815595f:32853423_33054502 : 201080 bp : 44.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 19948 20079 132 0 0 58 68 42 0.342 0.04 1.02 Intr + 20396 20563 168 1 0 135 99 113 0.697 17.24 1.03 Intr + 29568 29594 27 2 0 101 78 37 0.668 2.31 1.04 Intr + 32512 32646 135 0 0 85 87 254 0.996 25.76 1.05 Term + 36938 38389 1452 1 0 131 32 2414 0.995 230.84 1.06 PlyA + 38821 38826 6 -0.45 2.00 Prom + 38848 38887 40 -5.66 2.01 Init + 49638 50055 418 2 1 97 76 286 0.126 24.89 2.02 Term + 60808 60857 50 2 2 67 55 58 0.034 -2.13 2.03 PlyA + 62346 62351 6 1.05 3.00 Prom + 79639 79678 40 -1.76 3.01 Init + 82400 82467 68 1 2 63 78 59 0.730 2.94 3.02 Intr + 86604 86731 128 0 2 98 58 145 0.940 12.82 3.03 Term + 87355 87545 191 2 2 65 43 61 0.381 -3.09 3.04 PlyA + 88964 88969 6 -0.45 4.00 Prom + 89794 89833 40 -2.26 4.01 Sngl + 100001 101083 1083 1 0 52 43 872 0.999 76.18 4.02 PlyA + 102853 102858 6 1.05 5.00 Prom + 111301 111340 40 -5.16 5.01 Init + 111829 111964 136 0 1 59 57 149 0.144 9.20 5.02 Intr + 117226 117357 132 2 0 56 111 36 0.127 3.32 5.03 Intr + 119634 119682 49 1 1 86 88 11 0.461 -1.36 5.04 Intr + 122859 122916 58 0 1 84 46 80 0.830 2.29 5.05 Term + 123251 123472 222 1 0 128 38 88 0.854 4.72 5.06 PlyA + 125871 125876 6 1.05 6.14 PlyA - 126131 126126 6 1.05 6.13 Term - 143922 143623 300 0 0 116 45 183 0.007 12.02 6.12 Intr - 147855 147625 231 0 0 21 86 107 0.006 1.77 6.11 Intr - 160888 160634 255 1 0 137 102 166 0.987 20.54 6.10 Intr - 163418 163287 132 2 0 73 97 94 0.991 9.64 6.09 Intr - 165139 165026 114 1 0 61 78 102 0.992 7.14 6.08 Intr - 168233 168144 90 2 0 95 96 23 0.121 3.99 6.07 Intr - 169787 169695 93 2 0 66 30 94 0.013 1.56 6.06 Intr - 170903 170829 75 2 0 60 116 52 0.016 4.91 6.05 Intr - 172578 172418 161 1 2 68 58 62 0.004 0.91 6.04 Intr - 181280 181103 178 2 1 73 56 168 0.000 11.59 6.03 Intr - 192810 192698 113 1 2 98 98 148 0.949 16.90 6.02 Intr - 194707 194572 136 1 1 21 113 65 0.250 2.44 6.01 Intr - 200127 200069 59 1 2 100 91 41 0.394 4.20 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 8230 8191 40 1 1 83 67 43 0.839 2.05 S.002 Sngl - 144063 143623 441 0 0 79 45 316 0.940 20.65 S.003 Init + 180441 180764 324 2 0 79 5 231 0.974 11.23 S.004 Term + 180774 180914 141 2 0 80 47 201 0.999 13.13 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:32853423_33054502|GENSCAN_predicted_peptide_1|637_aa ARAAKETFLVHSYIQATNIRLGNTVLTTFLDNEAKIQGKLSCQEVLHLYCDTCSVPICRE CTMGRHGGHSFIYLQEALQDSRALTIQLLADAQQGRQAIQTKQKKLLLQLSIEQAQTVAE QVEMKAKVVQSEVKAVTARHKKALEERECELLWKVEKIRQVKAKSLYLQVEKLRQNLNKL ESTISAVQQVLEEGRALDILLARDRMLAQVQELKTVRSLLQPQEDDRVMFTPPDQALYLA IKSFGFVSSGAFAPLTKATGDGLKRALQGKVASFTVIGYDHDGEPRLSGGDLMSAVVLGP DGNLFGAEVSDQQNGTYVVSYRPQLEGEHLVSVTLCNQHIENSPFKVVVKSGRSYVGIGL PGLSFGSEGDSDGKLCRPWGVSVDKEGYIIVADRSNNRIQVFKPCGAFHHKFGTLGSRPG QFDRPAGVACDASRRIVVADKDNHRIQIFTFEGQFLLKFGEKGTKNGQFNYPWDVAVNSE GKILVSDTRNHRIQLFGPDGVFLNKYGFEGALWKHFDSPRGVAFNHEGHLVVTDFNNHRL LVIHPDCQSARFLGSEGTGNGQFLRPQGVAVDQEGRIIVADSRNHRVQMFESNGSFLCKF GAQGSGFGQMDRPSGIAITPDGMIVVVDFGNNRILVF >gi568815595f:32853423_33054502|GENSCAN_predicted_CDS_1|1914_bp gctagagcagcaaaggaaacttttctggtacattcttacatccaggccactaatatcaga ctaggtaacacagtcttaacaacttttctggataatgaagctaagattcagggcaaactc tcatgccaggaggtgctgcacctgtactgtgacacttgctctgtacccatctgtcgtgag tgcacaatgggccggcatgggggccacagcttcatctacctccaggaggcactgcaggac tcacgggcactcaccatccagctgctggcagatgcccagcagggacgacaggcaatccag acaaagcagaagaagctgcttctgcagctgagcatcgagcaggcccagacggtggcggaa caggtggagatgaaggcgaaggttgtgcagtcggaggtcaaagccgtgacggcgaggcat aagaaagccctggaggaacgcgagtgtgagctgctgtggaaggtagaaaagatccgccag gtgaaagccaagtctctgtacctgcaggtggagaagctgcggcaaaacctcaacaagctt gagagcaccatcagtgccgtgcagcaggtcctggaggagggtagagcgctagacatccta ctggcccgagaccggatgctggcccaggtgcaggagctgaagaccgtgcggagcctcctg cagccccaggaagacgaccgagtcatgttcacaccccccgatcaggcactgtaccttgcc atcaagtcttttggctttgttagcagcggggcctttgccccactcaccaaggccacaggc gatggcctcaagcgtgccctccagggtaaggtggcctccttcacagtcattggttatgac cacgatggtgagccccgcctctcaggaggcgacctgatgtcggctgtggtcctgggccct gatggcaacctgtttggtgcagaggtgagtgatcagcagaatgggacatacgtggtgagt taccgaccccagctggagggtgagcacctggtatctgtgacactgtgcaaccagcacatt gagaacagccctttcaaggtggtggtcaagtcaggccgcagctacgtgggcattgggctc ccgggcctgagcttcggcagtgagggtgacagcgatggcaagctctgccgcccttggggt gtgagtgtagacaaggagggctacatcattgtcgccgaccgcagcaacaaccgcatccag gtgttcaagccctgcggcgccttccaccacaaattcggcaccctgggctcccggcctggg cagttcgaccgaccagccggcgtggcctgtgacgcctcacgcaggatcgtggtggctgac aaggacaatcatcgcatccagatcttcacgttcgagggccagttcctcctcaagtttggt gagaaaggaaccaagaatgggcagttcaactacccttgggatgtggcggtgaattctgag ggcaagatcctggtctcagacacgaggaaccaccggatccagctgtttgggcctgatggt gtcttcctaaacaagtatggcttcgagggggctctctggaagcactttgactccccacgg ggtgtggccttcaaccatgagggccacttggtggtcactgacttcaacaaccaccggctc ctggttattcaccccgactgccagtcggcacgctttctgggctcggagggcacaggcaat gggcagttcctgcgcccacaaggggtagctgtggaccaggaagggcgcatcattgtggcg gattccaggaaccatcgggtacagatgtttgaatccaacggcagcttcctgtgcaagttt ggtgctcaaggcagcggctttgggcagatggaccgcccttccggcatcgccatcaccccc gacggaatgatcgttgtggtggactttggcaacaatcgaatcctcgtcttctaa >gi568815595f:32853423_33054502|GENSCAN_predicted_peptide_2|155_aa MPEPSPASMGSCAARASPTNAAPCSTAPSPIDRPRAEECERMARDWQAAPPAGSSTCSNP LRDPLGEASWAPESGGDVESLYIQLRDCKHTNQHPVFSSRFVNAPIGTLYLAALVGTWRT FMSSSGIVNTPISTLCLAQGHCSKSKAKTQAEVHF >gi568815595f:32853423_33054502|GENSCAN_predicted_CDS_2|468_bp atgcctgagccttcccccgcctccatgggttcctgtgcagcccgagcttccccgacgaat gccgccccctgctccacggcgcccagtcccatcgaccgcccaagagctgaggagtgcgag cgcatggcgcgggactggcaggcagctccacctgcaggcagctccacctgcagcaacccc ctgcgggatccactgggtgaagccagctgggctcctgagtctggtggggacgtggaaagt ctttatatccagctcagggattgtaaacacaccaatcagcaccctgtgtttagctcaagg tttgtgaatgcaccaatcggcactctgtatctagctgccctggtggggacgtggagaacc tttatgtctagctcagggattgtaaatacaccaatcagcaccctgtgtttagctcaagga cattgttccaaaagcaaggccaagacgcaggcagaagtccacttctga >gi568815595f:32853423_33054502|GENSCAN_predicted_peptide_3|128_aa MNMTTCQEEFSDEISVTLLNPIRAPSPIDRPKGEECERMARDWQTAPPAAPVRDPLGEDS WAPESVWKLCSFALYNKSCYCSLFGLTAFMSCNTHCDNLQLHSSAQRDHEPTERNEQLQT HWFKSCNT >gi568815595f:32853423_33054502|GENSCAN_predicted_CDS_3|387_bp atgaatatgaccacttgccaggaagaattttctgatgagatcagtgtgacattactgaat cccatacgggcgcccagtcccatcgaccgcccaaagggtgaggagtgcgagcgcatggcg cgggactggcagacagctccacctgcagccccagtgcgggatccactgggtgaagacagc tgggctcctgagtctgtgtggaagctttgttcttttgctctttacaacaaatcctgctac tgttcgctctttgggttaactgcttttatgagctgtaacactcactgtgacaatttgcag cttcactcctcagcccagcgagaccacgagcccactgagaggaacgaacaactccagacg cactggtttaagagctgtaacacttaa >gi568815595f:32853423_33054502|GENSCAN_predicted_peptide_4|360_aa MNPTDIADTTLDESIYSNYYLYESIPKPCTKEGIKAFGELFLPPLYSLVFVFGLLGNSVV VLVLFKYKRLRSMTDVYLLNLAISDLLFVFSLPFWGYYAADQWVFGLGLCKMISWMYLVG FYSGIFFVMLMSIDRYLAIVHAVFSLRARTLTYGVITSLATWSVAVFASLPGFLFSTCYT ERNHTYCKTKYSLNSTTWKVLSSLEINILGLVIPLGIMLFCYSMIIRTLQHCKNEKKNKA VKMIFAVVVLFLGFWTPYNIVLFLETLVELEVLQDCTFERYLDYAIQATETLAFVHCCLN PIIYFFLGEKFRKYILQLFKTCRGLFVLCQYCGLLQIYSADTPSSSYTQSTMDHDLHDAL >gi568815595f:32853423_33054502|GENSCAN_predicted_CDS_4|1083_bp atgaaccccacggatatagcagacaccaccctcgatgaaagcatatacagcaattactat ctgtatgaaagtatccccaagccttgcaccaaagaaggcatcaaggcatttggggagctc ttcctgcccccactgtattccttggtttttgtatttggtctgcttggaaattctgtggtg gttctggtcctgttcaaatacaagcggctcaggtccatgactgatgtgtacctgctcaac cttgccatctcggatctgctcttcgtgttttccctccctttttggggctactatgcagca gaccagtgggtttttgggctaggtctgtgcaagatgatttcctggatgtacttggtgggc ttttacagtggcatattctttgtcatgctcatgagcattgatagatacctggcaattgtg cacgcggtgttttccttgagggcaaggaccttgacttatggggtcatcaccagtttggct acatggtcagtggctgtgttcgcctcccttcctggctttctgttcagcacttgttatact gagcgcaaccatacctactgcaaaaccaagtactctctcaactccacgacgtggaaggtt ctcagctccctggaaatcaacattctcggattggtgatccccttagggatcatgctgttt tgctactccatgatcatcaggaccttgcagcattgtaaaaatgagaagaagaacaaggcg gtgaagatgatctttgccgtggtggtcctcttccttgggttctggacaccttacaacata gtgctcttcctagagaccctggtggagctagaagtccttcaggactgcacctttgaaaga tacttggactatgccatccaggccacagaaactctggcttttgttcactgctgccttaat cccatcatctacttttttctgggggagaaatttcgcaagtacatcctacagctcttcaaa acctgcaggggcctttttgtgctctgccaatactgtgggctcctccaaatttactctgct gacacccccagctcatcttacacgcagtccaccatggatcatgatctccatgatgctctg tag >gi568815595f:32853423_33054502|GENSCAN_predicted_peptide_5|198_aa MWESLELPGDLLNGLDQNADSDMDNKAQIEVVSDGNKEVLGDWSKGVTRWIHHSRLKPVA AVTPDDNQWISQQDPNRPTRIILRRNLTTVCFPLHRSPLTHEMLKSLRRLTGAIEGLFCF LLLKLPVEASCFLGPLPQKLDVEKKGKDVSFQEYRWDRFNFKIIRNSEVGVDKEEKSEDK RMIKKMDGGRISFTHWDI >gi568815595f:32853423_33054502|GENSCAN_predicted_CDS_5|597_bp atgtgggaaagtttggaacttcctggagacttgttgaatggcttagatcaaaatgctgat agtgatatggacaataaagcccagattgaggtggtctcagatggaaataaggaagttctt ggggactggagcaaaggtgtcacacgttggattcaccatagccggctgaaaccagtggca gcagtgactcccgatgacaaccagtggattagccagcaagacccaaatcgccccacccga ataatcctacggcgaaacctaaccaccgtatgcttccccctgcacagatctcccctcacc cacgaaatgcttaaaagcctcagaagacttactggagctatcgagggccttttctgcttc ctgctgctcaagctgcccgtggaagcttcatgcttcttagggccccttcctcagaagctg gatgtggagaagaaaggcaaggatgtgtcatttcaggaatatagatgggacaggtttaat tttaaaattattagaaattcagaggtgggggtggacaaagaagagaagagtgaagacaaa aggatgatcaagaaaatggatggtggtagaatatcttttacacactgggatatttaa >gi568815595f:32853423_33054502|GENSCAN_predicted_peptide_6|645_aa XSNITDAFLSQRKCEPKGPLDSRASEAQSLPLRCKQCSMERDITNKKHSTAWVTTTLEAD EGHKEGANSPYAAQPTSYDYDAPLSEAGDLTEKYFALRNIIQKGHLIVSTRDGQYVGCDG PARMLHDIIELVQQFGRPYISQGIITGPDEHTTIPGAAGNGAAGDSAQSHRGGLQGLTTS FLLHPFHSSSKARAVTRGQGQQRQRLRAWEWVLPGPFEKVPEGPIPPSTPKFAYGKVTLE KKIAPNVQNNNIAKTKLAYYQKHILFTVKENCLKTVGAALDILCPSGPIKSLYPLTFIQV KQHYGFVLYRTTLPQDCSNPAPLSSPLNGVHDRAYVAVDGIPQGVLERNNVITLNITGKA GATLDLLVENMGRVNYGAYINDFKGLVSNLTLSSNILTDWTIFPLDTEDAVCSHLGGWGH RDSGHHDEAWAHNSSNYTLPAFYMGNFSIPSGIPDLPQDTFIQFPGWTKQLESNLGNVAR CKSEREREKREDRKEEKRRRRRRRGEEKRREKGRTRRRRKRRKEQEVVESGVSKGECGRH KEERRKGQVWINGFNLGRYWPARGPQLTLFVPQHILMTSAPNTITVLELEWAPCSSDDPE LCAVTFVDRPVIGSSVTYDHPSKPVEKRLMPPPPQKNKDSWLDHV >gi568815595f:32853423_33054502|GENSCAN_predicted_CDS_6|1938_bp ngcagcaacatcacagatgctttcctaagccagaggaagtgtgagcccaaaggacccttg gactcaagggcaagtgaggcacagtctctgcccctgaggtgcaagcagtgtagcatggag agagacataacaaataagaaacacagcacagcatgggtgactaccacattagaagcagac gagggccacaaagaaggggccaactcaccctatgcagcacagcccaccagctacgactat gatgccccactgagtgaggctggggacctcactgagaagtattttgctctgcgaaacatc atccagaagggtcaccttattgtctccaccagagatggccaatatgttggctgtgatgga cctgcccgcatgctacatgacatcattgaacttgtgcagcaatttgggagaccatatatt tcccaaggcatcatcacaggtccagatgaacacacgaccatcccaggagcagctggcaat ggtgctgcgggtgactcggcccagtcccatcgtggcgggcttcagggactgaccacctcc ttcctgctgcaccctttccacagcagcagcaaggcaagagcagtgacgcggggccagggt caacagcggcagaggctccgggcctgggagtgggtcctgcctggcccctttgaaaaagta ccagaaggtcctatccctccatctacaccaaagtttgcatatggaaaggtcactttggaa aagaaaatagctccaaatgttcaaaacaacaacattgctaaaaccaagctagcttattac caaaagcacattctgtttaccgtgaaggaaaattgtttaaagacagtgggagcagctctg gacattctgtgtccctctgggcccatcaaaagcctttatcccttgacatttatccaggtg aaacagcattatgggtttgtgctgtaccggacaacacttcctcaagattgcagcaaccca gcacctctctcttcacccctcaatggagtccacgatcgagcatatgttgctgtggatggg atcccccagggagtccttgagcgaaacaatgtgatcactctgaacataacagggaaagct ggagccactctggaccttctggtagagaacatgggacgtgtgaactatggtgcatatatc aacgattttaagggtttggtttctaacctgactctcagttccaatatcctcacggactgg acgatctttccactggacactgaggatgcagtgtgcagccacctggggggctggggacac cgtgacagtggccaccatgatgaagcctgggcccacaactcatccaactacacgctcccg gccttttatatggggaacttctccattcccagtgggatcccagacttgccccaggacacc tttatccagtttcctggatggaccaagcaactggagtccaaccttggcaatgtagcaaga tgtaagagtgagagagagagagagaagagagaagacaggaaggaggagaagagaagaagg agaaggagaagaggagaggagaagagaagagagaaaggaagaacaagaaggagaagaaag agaaggaaagaacaagaagtagtggagagtggagtgagcaagggggagtgcggcagacac aaggaggaaaggaggaagggccaggtctggattaatggctttaaccttggccgctattgg ccagcccggggccctcagttgaccttgtttgtgccccagcacatcctgatgacctcggcc ccaaacaccatcaccgtgctggaactggagtgggcaccctgcagcagtgatgatccagaa ctatgtgctgtgacgttcgtggacaggccagttattggctcatctgtgacctacgatcat ccctccaaacctgttgaaaaaagactcatgcccccacccccgcaaaaaaacaaagattca tggctggaccatgtatga