GENSCAN 1.0 Date run: 6-Nov-116 Time: 16:10:15 Sequence gi568815590r:37865264_38066469 : 201206 bp : 47.54% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 39 34 6 1.05 1.08 Term - 3450 3385 66 0 0 101 43 39 0.816 -1.36 1.07 Intr - 5328 5157 172 2 1 88 95 129 0.609 13.55 1.06 Intr - 7829 6015 1815 1 0 144 94 530 0.931 46.53 1.05 Intr - 10059 9252 808 1 1 77 117 531 0.933 45.41 1.04 Intr - 12288 11846 443 2 2 117 101 577 0.936 55.00 1.03 Intr - 17771 17609 163 0 1 69 90 94 0.197 6.73 1.02 Intr - 24447 24413 35 2 2 59 65 15 0.036 -5.83 1.01 Init - 34178 33808 371 2 2 92 73 717 0.143 65.07 1.00 Prom - 54706 54667 40 -6.06 2.11 PlyA - 54954 54949 6 1.05 2.10 Term - 69223 69030 194 2 2 26 34 194 0.997 5.38 2.09 Intr - 69952 69810 143 0 2 93 65 136 0.990 11.80 2.08 Intr - 70606 70441 166 2 1 62 91 104 0.925 7.12 2.07 Intr - 71607 71457 151 0 1 126 56 46 0.916 4.94 2.06 Intr - 72123 72014 110 1 2 60 65 105 0.846 5.40 2.05 Intr - 72486 72375 112 0 1 83 96 53 0.881 5.55 2.04 Intr - 73571 73437 135 2 0 57 76 110 0.132 7.46 2.03 Intr - 83525 83423 103 1 1 86 46 47 0.038 0.48 2.02 Intr - 99321 99190 132 2 0 80 62 75 0.730 3.86 2.01 Init - 101206 100002 1205 1 2 75 77 1603 0.921 148.44 2.00 Prom - 108378 108339 40 -3.76 3.00 Prom + 110459 110498 40 -2.86 3.01 Init + 111293 111352 60 1 0 52 53 102 0.665 4.35 3.02 Term + 117033 117200 168 2 0 28 47 141 0.594 1.88 3.03 PlyA + 117260 117265 6 1.05 4.04 PlyA - 118276 118271 6 1.05 4.03 Term - 118584 118448 137 1 2 49 44 151 0.490 4.98 4.02 Intr - 136143 136027 117 1 0 62 92 38 0.122 2.04 4.01 Init - 136654 136603 52 1 1 63 110 34 0.189 4.44 4.00 Prom - 154649 154610 40 -1.96 5.00 Prom + 159829 159868 40 -4.26 5.01 Init + 165311 165455 145 1 1 79 80 211 0.648 19.54 5.02 Intr + 191818 191997 180 2 0 147 101 228 0.999 29.84 5.03 Term + 194641 194672 32 2 2 142 42 24 0.827 1.22 5.04 PlyA + 195072 195077 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 34178 33804 375 2 0 92 48 741 0.846 64.64 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:37865264_38066469|GENSCAN_predicted_peptide_1|1290_aa MSLMVSAGRGLGAVWSPTHVQVTVLQARGLRAKGPGGTSDAYAVIQVGKEKYATSVSERS LGAPVWREEATFELPSLLSSGPAAAATLQLTVLHRALLGLDKFLGRAEVDLRDLHRDQGR RKTQNSHGKKYGSLCAPWLVPGYLVFGFRFACEIKQQEQLVTFIFALPNGVGSFQSKSHF RTQGLDLRIRWYKLKSKPGKKDKERGEIEVDIQFMRNNMTASMFDLSMKDKSRNPFGKLK DKIKGKNKDSGSDTASAIIPSTTPSVDSDDESVVKDKKKKSKIKTLLSKSNLQKTPLSQS MSVLPTSKPEKVLLRPGDFQSQWDEDDNEDESSSASDVMSHKRTASTDLKQLNQVNFTLP KKEGLSFLGGLRSKNDVLSRSNVCINGNHVYLEQPEAKGEIKDSSPSSSPSPKGFRKKHL FSSTENLAAGSWKEPAEGGGLSSDRQLSESSTKDSLKSMTLPSYRPAPLVSGDLRENMAP ANSEATKEAKESKKPESRRSSLLSLMTGKKDVAKGSEGENPLTVPGREKEGMLMGVKPGE DASGPAEDLVRRSEKDTAAVVSRQGSSLNLFEDVQITEPEAEPESKSEPRPPISSPRAPQ TRAVKPRSGQASVPSELGHGADTQSSESPSVFSSLSSPIAAPISTSTPIESWPLVDRGQA KSEGPPLLPKAELQTESLTPVPNSGSSALGSLFKQPSFPANKGTEDSLMGRTRETGTEKN TSSLELEESLPEQPETGRQEEELPRFPCKKQDYSPSSGEAQEVPFALSLSSDGAVSPVGE LAAGGDRDLESQAGSLVESKARDAAEEVAPPLPMGASVPSIDSMMRKLEEMGLNLRKDQK KTKKRVSFSEQLFTEEAVAGAALLVEGHSSCPQELNPAWSVAGNASDGEPPESPHAEDSE RESVTTPGPATCGAPASPADHLLLPSQEESFSEVPMSEASSAKDTPLFRMEGEDALVTQY QSKASDHEGLLSDPLSDLQLVSDFKSPIMADLNLSLPSIPEVASDDERIDQVEDDGDQVE DDGETAKSSTLDIGALSLGLVVPCPERGKGPSGEADRLVLGEGLCDFRLQAPQASVTAPS EQTTEFGIHKPHLGKSSSLDKQLPGPSGGEEEKPMGNGSPSPPPGTSLDNPVPSPSPSEI FPVTHSFPSSAHSDTHHTSTAESQKKATAEGSAGRVENFGKRKPLLQAWVSPSETHPVSA QPGAGTGSAKHSGQLLSFAMTGGSKDPFPLLFRLHPVKPMNAMATKVANCSLGTATIISE NLNNEVMMKNLKDLEYCLAANPWSSTHYLD >gi568815590r:37865264_38066469|GENSCAN_predicted_CDS_1|3873_bp atgtccctaatggtctcggctggccggggcctgggggccgtgtggtccccaacccacgtg caggtgacggtgctgcaggcgcggggcctgcgggccaagggccccgggggcacgagcgac gcgtacgcggtgatccaggtgggcaaggagaagtacgccacctccgtgtcggagcgcagc ctgggcgcgcccgtgtggcgcgaggaggccaccttcgagctgccatcgctgctgtcctcc ggacccgcggccgccgccaccctgcagctcaccgtgctgcaccgcgcgctgctcggcctc gacaagttcctgggccgcgccgaggtggacctgcgggatctgcaccgcgaccagggccgc aggaagacgcaaaactctcatggcaagaagtatgggagtttatgtgccccgtggctggtg cctggctacctggtgtttggctttaggtttgcttgtgaaattaagcagcaagaacagttg gtgacgtttatatttgcacttcctaatggtgttggaagtttccaaagcaaatcccacttt aggacccagggattggacctgagaatcaggtggtataagttgaaatccaaaccaggaaag aaggacaaggagcgaggagaaattgaggttgacatccagtttatgagaaacaacatgact gccagcatgtttgacctttctatgaaagacaagtctcggaatccatttggaaagctgaag gacaagatcaaggggaagaataaggacagtgggtcagacaccgcctccgccatcatccct agcacgacaccttcggtcgacagtgatgatgagtctgtggttaaagacaagaaaaagaaa tcaaagatcaagaccttactttccaagtcaaatttgcagaagacgcctctttcccagtcc atgtctgtcctgccgacttcaaagccagaaaaagtgctgcttcgtcccggagactttcag tcccagtgggatgaagatgacaatgaggatgagtcctcctcggcctcggatgtcatgtct cacaagagaacagcgagtacggatcttaagcaactgaaccaggtcaactttacccttccc aagaaggaaggactttcctttcttggtggccttcggtctaagaatgatgtcctttcccgc tctaatgtctgcatcaatgggaaccatgtttacctggagcagccagaagccaagggtgag atcaaggatagcagcccgtcctcctccccatcccccaagggcttcagaaagaagcatttg ttctcttctacagagaacctggcggctgggtcttggaaggagcctgctgaaggaggtggg ctgtcttctgacaggcagctctccgaatcttccaccaaggactccttgaagtctatgacc ctgccgtcctaccgacctgccccactggtcagtggggacctcagggaaaacatggccccc gcaaactcagaggccacaaaagaagctaaggagagcaagaagccagagagcaggaggtcc tctttgctgtctctgatgacggggaagaaggatgtggctaagggcagtgaaggtgaaaac cctctcacggtcccagggagggagaaggaaggcatgctgatgggggttaagccgggggag gacgcatcggggcctgctgaagaccttgtgagaagatctgagaaagatactgcagctgtt gtctccagacagggcagctccctgaacctctttgaagatgtgcagatcacagaaccagaa gctgagccagagtccaagtctgaaccgagacctccaatttcctctccgagggctccccag accagagctgtcaagccccgctctggccaggcatctgtcccctctgaattgggacatggt gcagacacacagtcctctgagagtccttctgtcttctcctctctctcatctcccatagca gctcccatttccacatccactccaattgaaagctggcctctcgtagacaggggccaggcc aagtctgaaggaccacccttgctccctaaggcagagttgcaaactgagagtttaacaccg gttccaaattctggttcttctgccctgggatcacttttcaaacagccatcctttccagca aataaagggacagaagattctctgatgggcaggacccgtgagacaggcacagagaagaac accagcagccttgagttggaggagtctctcccagagcagcctgaaacagggcgacaagag gaagaacttccgagattcccctgcaaaaaacaagactacagcccatcatctggagaagcc caggaagtaccgtttgccctttcactcagcagtgatggagctgtgagccctgttggggag cttgcagcaggaggagacagagacttggagagtcaggctgggtctcttgtggagagcaaa gccagggatgcagctgaagaagtggcgccccctcttcccatgggagcatcagtcccttcc attgattccatgatgcggaagctggaagagatgggtctgaacctccgcaaggaccagaag aaaaccaagaagcgtgtgtcattttctgagcagctcttcacggaagaggcagtggcgggg gctgccttgctggtggaaggacacagcagttgtccccaggagctgaaccctgcatggtct gttgctggaaacgcgtctgacggagagcctcctgagtctccccacgcagaggactcagaa agggaatcggtgaccacacctgggccagcgacgtgtggtgcgccagcctccccagcggat cacctcctcctcccctcccaggaggagagtttctccgaagtccccatgagtgaagcaagc tcagcgaaagacactccactctttaggatggagggagaggatgcccttgtgactcagtat cagagcaaagccagtgaccacgaaggtttattgtctgaccccttgagtgaccttcagttg gtctcagattttaaatctccaatcatggccgatctgaacttaagccttccttccattcct gaagtcgcatcggatgatgaaagaatagatcaggttgaagatgacggagatcaggttgaa gatgatggagagacagcaaagtcgtcaactctggacataggagctttgtccttgggcttg gtagtcccctgtcctgagaggggaaaggggcccagtggcgaggcagataggttggtactg ggggagggcctgtgtgatttcaggctgcaagcaccccaggcatctgtgacagctccttca gagcagaccacagagttcggaattcacaaaccacatcttggcaagagctcaagcttggat aaacagctgccaggccccagtggtggtgaggaagaaaaaccgatgggaaatgggagtcca agcccgcctcctggcacatccctggacaatcctgtacccagcccctccccttctgagatc tttcctgtcacacactctttccccagctctgcacattctgacactcaccacaccagcaca gcagaatctcaaaaaaaagccacagcagagggctccgctggtagagttgaaaattttggc aagaggaagccactcctccaggcctgggtctcaccctcggagacacatccagtctcagct cagccaggcgctggaactgggtcagccaagcacagtgggcagttgctcagttttgccatg accggagggtctaaagatccttttccccttctctttagacttcatcctgtgaagccaatg aatgcaatggccaccaaggttgctaactgcagcttgggaactgccaccatcatcagtgag aacttgaacaatgaggtcatgatgaagaatctgaaggatttagaatactgccttgctgca aatccctggtcttcaactcattatctggactag >gi568815590r:37865264_38066469|GENSCAN_predicted_peptide_2|816_aa MAPWPHENSSLAPWPDLPTLAPNTANTSGLPGVPWEAALAGALLALAVLATVGGNLLVIV AIAWTPRLQTMTNVFVTSLAAADLVMGLLVVPPAATLALTGHWPLGATGCELWTSVDVLC VTASIETLCALAVDRYLAVTNPLRYGALVTKRCARTAVVLVWVVSAAVSFAPIMSQWWRV GADAEAQRCHSNPRCCAFASNMPYVLLSSSVSFYLPLLVMLFVYARVFVVATRQLRLLRG ELGRFPPEESPPAPSRSLAPAPVGTCAPPEGVPACGRRPARLLPLREHRALCTLGLIMGT FTLCWLPFFLANVLRALGGPSLVPGPAFLALNWLGYANSAFNPLIYCRSPDFRSAFRRLL CRCGRRLPPEPCAAARPALFPSGVPAARSSPAQPRLCQRLDGPGSDLKAGLVPPHPPPKP SASVLGFLKGLTAVEVRIHFRYEVQLLLALYTTGGHGPIKTHLAATHRGSSYTEAIPFGL KTRLQISQDPSLNYEYLPTMGLKSFIQASLALLFGKHSQAIVENRVGGVHTVGDSGAFQL GVQFLRAWHKDARIVYIISSQKELHGLVFQDMGFTVYEYSVWDPKKLCMDPDILLNVVES KQIFPFFDIPCQGLYTSDLEEDTRILQYFVSQGFEFFCSQSLSKNFGIYDEGVGMLVVVA VNNQQLLCVLSQLEGLAQALWLNPPNTGARVITSILCNPALLGEWKQSLKEVVENIMLTK EKVKEKLQLLGTPGSWGHITEQSGTHGYLGLNSQQVEYLVRKKHIYIPKNGQINFSCINA NNINYITEGINEAVLLTESSEMCLPKEKKTLIGIKL >gi568815590r:37865264_38066469|GENSCAN_predicted_CDS_2|2451_bp atggctccgtggcctcacgagaacagctctcttgccccatggccggacctccccaccctg gcgcccaataccgccaacaccagtgggctgccaggggttccgtgggaggcggccctagcc ggggccctgctggcgctggcggtgctggccaccgtgggaggcaacctgctggtcatcgtg gccatcgcctggactccgagactccagaccatgaccaacgtgttcgtgacttcgctggcc gcagccgacctggtgatgggactcctggtggtgccgccggcggccaccttggcgctgact ggccactggccgttgggcgccactggctgcgagctgtggacctcggtggacgtgctgtgt gtgaccgccagcatcgaaaccctgtgcgccctggccgtggaccgctacctggctgtgacc aacccgctgcgttacggcgcactggtcaccaagcgctgcgcccggacagctgtggtcctg gtgtgggtcgtgtcggccgcggtgtcgtttgcgcccatcatgagccagtggtggcgcgta ggggccgacgccgaggcgcagcgctgccactccaacccgcgctgctgtgccttcgcctcc aacatgccctacgtgctgctgtcctcctccgtctccttctaccttcctcttctcgtgatg ctcttcgtctacgcgcgggttttcgtggtggctacgcgccagctgcgcttgctgcgcggg gagctgggccgctttccgcccgaggagtctccgccggcgccgtcgcgctctctggccccg gccccggtggggacgtgcgctccgcccgaaggggtgcccgcctgcggccggcggcccgcg cgcctcctgcctctccgggaacaccgggccctgtgcaccttgggtctcatcatgggcacc ttcactctctgctggttgcccttctttctggccaacgtgctgcgcgccctggggggcccc tctctagtcccgggcccggctttccttgccctgaactggctaggttatgccaattctgcc ttcaacccgctcatctactgccgcagcccggactttcgcagcgccttccgccgtcttctg tgccgctgcggccgtcgcctgcctccggagccctgcgccgccgcccgcccggccctcttc ccctcgggcgttcctgcggcccggagcagcccagcgcagcccaggctttgccaacggctc gacggacctggctcggacttgaaggcagggctagtgcccccccacccgccccccaagccc tcggcctcagttctgggttttctcaaaggtttgacagctgtggaggtgagaatccacttc cggtatgaagtacagttactgttggctctgtacaccactggaggacatgggcccatcaaa acacacctggctgcaacacacagaggcagtagctacacagaagccataccttttggcctg aagactcgactacagatttcacaggatccctccctgaattatgagtacttgcccaccatg ggcctgaaatcattcatccaggcctctctagcactcctctttggaaagcacagccaagcc attgtggagaacagggtagggggtgtacacactgttggtgacagtggtgccttccagctt ggcgtccagtttctcagagcttggcataaggatgctcgtatagtttacatcatctcttct caaaaagaactgcatggactcgtcttccaggacatgggctttacagtttatgaatactct gtctgggaccccaagaagctatgcatggaccccgacatactcctcaatgtggtggagagc aagcagatattcccattttttgatattccctgtcaaggtttatacaccagtgacttggaa gaagatactagaatcttacaatactttgtgtctcaaggctttgagttcttctgcagccag tctctgtccaagaattttggcatttatgatgaaggagtggggatgctagtggtggtggca gtcaacaaccagcagctgctgtgtgtcctctcccagctggaaggattagcccaggccctg tggctaaacccccccaacacgggtgcacgtgtcatcacctccatcctctgcaaccctgct ctgctgggagaatggaagcagagtctaaaagaagttgtagagaacatcatgctaaccaag gaaaaagtgaaggagaaactccagctcctgggaacccctgggtcctggggtcacatcacc gagcagagtgggacccacggctatcttggactcaactcccagcaggtggaatacctggtc aggaagaagcacatctatatccccaagaacggtcagattaacttcagctgtatcaatgcc aacaacataaattacatcactgagggcatcaatgaggctgtcctcctcacagagagctca gagatgtgtcttccaaaggaaaaaaaaacactgattggaataaaactttag >gi568815590r:37865264_38066469|GENSCAN_predicted_peptide_3|75_aa MKRQTPWLQKAPTLVQKRNQLTIKQPEAGPSGGIPEEGIAITGDGSSMRVIAPKDLPVGQ DVEVEDSDIEDPDPG >gi568815590r:37865264_38066469|GENSCAN_predicted_CDS_3|228_bp atgaagcgccagaccccatggctgcagaaagctccaaccctggtgcagaagcggaaccag ttaactataaaacagcctgaggcaggtccttcaggaggtattccagaagaaggcattgct atcacaggagacggtagctccatgcgtgttatcgctcctaaagaccttccagtaggacaa gatgtggaggtggaagacagtgatattgaggatcctgaccctgggtag >gi568815590r:37865264_38066469|GENSCAN_predicted_peptide_4|101_aa METSWTITDALVTPTVEGFKGTKACYHSPATAWASKAYKLSLQLPFYLSKNRTSLTAEHS SALPLKVIYDVMRVAAINIAATDWAGPKISSTVPESMLAKD >gi568815590r:37865264_38066469|GENSCAN_predicted_CDS_4|306_bp atggaaacctcctggaccatcacagatgctttggtaactcctacagtggagggatttaag gggactaaagcctgttatcactcgcctgctacagcatgggcttctaaagcctataaactc tccttacaactcccattttacctgtccaaaaaccggacaagtcttacagctgagcattcc tccgcactaccgttaaaagtcatctatgatgtcatgagggtggcggccatcaacattgca gccacagactgggcaggccccaaaatctcttcaacggttccagagagtatgctggctaaa gattga >gi568815590r:37865264_38066469|GENSCAN_predicted_peptide_5|118_aa MSGGSSCSQTPSRAIPATRRVVLGDGVQLPPGDYSTTPGGTLFSTTPGGTRIIYDRKFLM ECRNSPVTKTPPRDLPTIPGVTSPSSDEPPMEASQSHLRNSPEDKRAGGEESQFEMDI >gi568815590r:37865264_38066469|GENSCAN_predicted_CDS_5|357_bp atgtccgggggcagcagctgcagccagaccccaagccgggccatccccgccactcgccgg gtggtgctcggcgacggcgtgcagctcccgcccggggactacagcacgacccccggcggc acgctcttcagcaccaccccgggaggtaccaggatcatctatgaccggaaattcctgatg gagtgtcggaactcacctgtgaccaaaacacccccaagggatctgcccaccattccgggg gtcaccagcccttccagtgatgagccccccatggaagccagccagagccacctgcgcaat agcccagaagataagcgggcgggcggtgaagagtcacagtttgagatggacatttaa