GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:25:55 Sequence gi568815590r:33448756_33658772 : 210017 bp : 45.18% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 4755 4461 295 0 1 126 103 123 0.846 14.18 1.01 Init - 12697 12617 81 1 0 98 81 62 0.898 7.48 1.00 Prom - 13562 13523 40 -5.46 2.00 Prom + 16051 16090 40 -4.86 2.01 Init + 36452 36783 332 1 2 98 23 100 0.332 0.61 2.02 Intr + 39765 39874 110 0 2 107 88 82 0.996 10.03 2.03 Intr + 39979 40043 65 2 2 88 102 0 0.996 -0.16 2.04 Intr + 40233 40384 152 2 2 101 98 80 0.992 9.26 2.05 Intr + 41530 41584 55 1 1 92 78 18 0.970 0.18 2.06 Intr + 46787 46861 75 1 0 97 93 97 0.999 10.81 2.07 Intr + 47870 47986 117 1 0 55 100 244 0.982 22.96 2.08 Intr + 48477 48542 66 2 0 72 83 54 0.809 2.40 2.09 Term + 49677 49874 198 2 0 107 34 344 0.994 28.30 2.10 PlyA + 50859 50864 6 1.05 3.08 PlyA - 50919 50914 6 -0.45 3.07 Term - 50998 50978 21 1 0 99 49 16 0.792 -2.89 3.06 Intr - 51735 51573 163 2 1 35 116 100 0.593 7.48 3.05 Intr - 54817 54674 144 0 0 99 90 31 0.931 3.80 3.04 Intr - 55180 54993 188 1 2 107 94 178 0.999 18.79 3.03 Intr - 58566 58474 93 0 0 78 90 38 0.855 3.16 3.02 Intr - 61177 60991 187 0 1 60 91 -3 0.717 -3.11 3.01 Init - 63858 63212 647 0 2 66 78 254 0.753 17.00 3.00 Prom - 77769 77730 40 -3.96 4.00 Prom + 85016 85055 40 -2.16 4.01 Sngl + 91301 91642 342 1 0 60 54 252 0.705 15.27 4.02 PlyA + 92103 92108 6 1.05 5.07 PlyA - 92697 92692 6 1.05 5.06 Term - 100112 99998 115 1 1 64 48 116 0.629 3.24 5.05 Intr - 100737 100655 83 0 2 117 70 18 0.457 1.34 5.04 Intr - 102330 102289 42 0 0 126 98 29 0.744 6.14 5.03 Intr - 102634 102589 46 0 1 101 88 72 0.998 6.91 5.02 Intr - 110016 109860 157 1 1 148 96 134 0.996 19.27 5.01 Init - 125507 125342 166 2 1 31 60 192 0.588 8.93 5.00 Prom - 140170 140131 40 -2.46 6.11 PlyA - 142595 142590 6 1.05 6.10 Term - 143457 143258 200 1 2 112 55 281 0.986 24.76 6.09 Intr - 144992 144778 215 1 2 81 75 373 0.361 33.66 6.08 Intr - 149018 148540 479 2 2 84 47 272 0.067 14.55 6.07 Intr - 151010 150910 101 0 2 65 65 51 0.392 0.33 6.06 Intr - 168923 168791 133 2 1 74 76 58 0.041 3.42 6.05 Intr - 183156 183012 145 2 1 69 6 112 0.012 1.38 6.04 Intr - 187579 187443 137 1 2 69 73 54 0.085 1.37 6.03 Intr - 191414 191340 75 2 0 104 89 30 0.363 4.41 6.02 Intr - 198551 198480 72 2 0 104 105 -18 0.256 1.10 6.01 Init - 206874 206797 78 0 0 81 88 43 0.412 4.66 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 83770 83657 114 1 0 83 38 97 0.953 2.77 S.002 Init - 84658 84596 63 1 0 45 82 81 0.832 2.35 S.003 Term - 149018 148509 510 2 0 84 43 278 0.808 17.27 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:33448756_33658772|GENSCAN_predicted_peptide_1|126_aa MVRIQRRKLLASCLCVTATVFLLVTLQVMVELGKFERKEFKSSSLQDGHTKMEEAPTHLN SFLKKEGLTFNRKRKWELDSYPIMLWWSPLTGETGRLGQCGADACFFTINRTYLHHHMTK AFLFYX >gi568815590r:33448756_33658772|GENSCAN_predicted_CDS_1|378_bp atggtgcggattcagaggaggaagcttttggcatcttgcctgtgcgtcacagccaccgtc tttctgcttgtcacactccaggtcatggttgagctggggaagtttgaaaggaaggagttt aaaagttccagtttgcaagatggacatacaaaaatggaggaagcacctacgcatcttaat tcatttcttaagaaagaaggattgaccttcaacaggaaaagaaaatgggaattggacagc taccccattatgctctggtggtccccgctgacgggggagactgggaggttaggccaatgt ggagcagatgcttgtttcttcaccatcaaccggacctacctccatcatcacatgaccaaa gcattcctcttctatgnn >gi568815590r:33448756_33658772|GENSCAN_predicted_peptide_2|389_aa MQSDDVSLLRLFLHPGFAQGILPIFGHLENAYAAGLLPRAALDVRLPEPRFSNIGSHACL AGVYCPLPRPGCEHRTPSPQPMGSMCPVLRAYQCRTSRHAVSCQLSAPDLGTKTQSFCRN EYSLTGLCNRSSCPLANSQYATIKEEKGQCYLYMKVIERAAFPRRLWERVRLSKNYEKAL EQIDENLIYWPRFIRHKCKQRFTKITQYLIRIRKLTLKRQRKLVPLSKKVERREKRREEK ALIAAQLDNAIEKELLERLKQDTYGDIYNFPIHAFDKALEQQEAESDSSDTEEKDDDDDD EEDVGKREFVEDGEVDESDISDFEDMDKLDASSDEDQDGKSSSEEEEEKALSAKHKGKMP LRGPLQRKRAYVEIEYEQETEPVAKAKTT >gi568815590r:33448756_33658772|GENSCAN_predicted_CDS_2|1170_bp atgcagtcggatgatgtgagtctcctccggttgttcttacacccggggtttgcgcaggga attttgcccattttcggacacttagaaaacgcgtacgcagcgggtctgttgcctcgtgcc gctctggatgtgaggcttccggaaccccgattttccaacataggttctcacgcgtgtcta gcaggggtttactgcccgctgccccggccgggttgtgagcacaggactccgtcacctcag cccatggggtcgatgtgtcccgtccttcgtgcctaccaatgcaggacgtcgcgtcatgct gtcagctgtcagctgtcagctccggacttgggaaccaagactcagagcttctgccgaaat gaatatagcctgactggactgtgtaatcggtcatcctgtcccctggcaaatagtcagtat gccactattaaagaagagaaaggacagtgctacttgtatatgaaggttatagaacgagcg gcttttcctcggcgtctctgggaacgggtccggcttagtaaaaactatgagaaagcactg gagcaaatagatgaaaatctgatttactggccccgtttcattcgacacaaatgtaagcag agattcaccaagatcacccaatacctaattcgaattagaaaacttacactaaagcgacag aggaaacttgttcctttgagtaagaaggtggagcgtagggagaaaagaagagaggaaaag gcattaatagctgctcagctggacaatgccattgagaaggaattactggagagactgaaa caagatacgtatggcgacatctacaacttccccattcatgccttcgacaaagccctggaa caacaggaggcagagagtgactcttcagatactgaggaaaaagatgatgatgatgatgat gaggaagatgtggggaaaagagaatttgtcgaagatggtgaggtagatgagagtgacata agtgattttgaggatatggataaactggatgccagcagtgatgaagatcaggatggtaaa tcctccagtgaggaggaggaagaaaaggcccttagtgcgaaacacaaaggcaaaatgccc ttgagaggaccactgcagagaaaacgagcctatgtggaaatagaatacgagcaggagaca gagcccgtggccaaagccaaaaccacgtga >gi568815590r:33448756_33658772|GENSCAN_predicted_peptide_3|480_aa MELDSALEAPSQEDSNLSEELSHSAFGQAFSKILHCLARPEARRGNVKDAVLKDLGDLIE ATEFDRLFEGTGARLRGMPETLGQVAKALEKYAAPSKEEEGGGDGHSEAAEKAAQVGLLF LKLLGKVETAKNSLVGPAWQTGLHHLAGPVYIFAITHSLEQPWTTPRSREVAREVLTSLL QVTECGSVAGFLHGENEDEKGRLSVILGLLKPDLYKESWKNNPAIKHVFSWTLQQVTRPW LSQHLERVLPASLVISDDYQTENKILGVHCLHHIVLNVPAADLLQYNRAQVLYHAISNHL YTPEHHLIQAVLLCLLDLFPILEKTLHWKGDGARPTTHCDEVLRLILTHMEPEHRLLLRR TYARNLPAFVNRLGILTVRHLKRLERVIIGYLEVYDGPEEEARLKILETLKLLMQHTWPR VSCRLVVLLKALLKLICDVARDPNLTPESVKSALLQEATDCLILLDRCSQGRVKKSLVED >gi568815590r:33448756_33658772|GENSCAN_predicted_CDS_3|1443_bp atggagcttgacagcgctctggaagccccatcgcaggaagactctaatttgtccgaggag ttgtctcactccgcctttggacaggccttctccaagattttacactgtcttgcccgcccg gaggcacgacgaggcaatgtaaaagatgcagttcttaaagacctcggtgatctaatagaa gccacagaatttgataggttatttgaggggactggtgcacggctccgcggaatgccggag acactggggcaggtagcaaaagccctggagaagtatgcagccccctccaaggaggaggaa ggtggaggtgatgggcactccgaagcggccgagaaagcagcccaagttgggttactgttt cttaaactgttagggaaagttgagactgctaagaattccctggtcggccctgcatggcag acgggcctgcatcacttggcaggacccgtttatatttttgccatcacacacagcttggag caaccatggaccactccgagatctcgggaagttgctagggaggtgctcacctcactgctt caagttactgaatgcggttctgtggcaggattcctacatggagaaaatgaagatgagaaa gggagactttcggtgatactagggcttctcaaacccgacttgtataaggaatcctggaag aataaccctgccatcaaacatgttttctcatggactctgcaacaggtcactcggccctgg ctgagccagcatctggaaagggtacttcccgcatcattggtcatttcagatgactatcag actgagaacaaaatcctgggtgtacactgtctccatcacattgtgcttaatgtgccagct gctgatttgctccagtataacagagcccaggtcctataccatgccatttccaaccacctg tacacaccagagcaccacctcattcaggctgtgctcctgtgtctgctggatttattcccc atcctggagaaaaccctgcactggaaaggagatggagctcgacccaccacccattgtgat gaggtcctgcggctgatcctgacccacatggagccagagcaccgccttcttttacgcagg acctacgcaagaaacctgccggctttcgtgaacaggttggggatcctaactgtccggcac ttaaagaggctggagagagtcatcattggttatctggaggtttatgatggacctgaggag gaagctagactgaagatattggaaaccctaaaacttctcatgcaacatacttggcccaga gtttcctgcagacttgtggtcttactgaaggccctcttgaaactgatttgtgatgtagca agggatccaaaccttacacctgagtctgttaagagcgccctgctacaggaggccacagac tgcctgattctcctggaccgctgttctcaaggacgggtaaagaaatctttagttgaagat tga >gi568815590r:33448756_33658772|GENSCAN_predicted_peptide_4|113_aa MAANSSFLGGVHGLFLVWVALRVLGDRPFKCTFMSLTLHYPRCRLETGIQGAFGKPQGTV ARVHIGQVKSICTKLQNKEHVIEAPCRAKFKFPGHQKIHISKKWGFTKFNVDE >gi568815590r:33448756_33658772|GENSCAN_predicted_CDS_4|342_bp atggctgcaaacagcagctttcttggtggtgtacatggcctgtttcttgtatgggttgct ctaagggtccttggagacaggcctttcaaatgtacgttcatgtctctgaccttgcactac ccccgatgtaggctcgaaacaggcattcaaggtgcctttggaaagccccagggcactgtg gccagggttcacattggccaagttaagtccatctgcaccaagctgcagaacaaggagcat gtgattgaggcgccatgcagggccaagttcaagttccctggccaccagaagatccacatc tccaagaagtggggctttaccaagttcaatgtggatgaatga >gi568815590r:33448756_33658772|GENSCAN_predicted_peptide_5|202_aa MGLRAVEQRVALVGEARAAQEPMEWVGGSGMAGCRSRALPCGKAAKARREIERSAGCFCG LGLVSTNKSCSMPPISFQDLPLNIYMVIFGTGIFVFMLSLIFCCYFISKLRNQAQSERYG YKEVVLKGDAKKLQLYGQTCAVCLEDFKGKDELGVLPCQHAFHRKCLVKWLEVRCVCPMC NKPIASPSEATQNIGILLDELV >gi568815590r:33448756_33658772|GENSCAN_predicted_CDS_5|609_bp atgggactgcgcgccgtggagcagagggtggcgctcgtcggggaagctcgggccgcacag gagcccatggagtgggtgggaggctcaggcatggcgggctgcaggtcccgagccctgccc tgtgggaaggcagccaaggcccggcgagaaatcgagcgcagcgccgggtgtttctgtggc ctgggactggttagcaccaacaagtcctgctcgatgccacccatcagtttccaggacctt ccgctcaacatctatatggtcatcttcggcacaggcatctttgtcttcatgctcagcctt atcttctgctgctattttatcagcaaactgcggaaccaggcacagagtgagcgatacgga tataaggaggtggtgcttaaaggtgatgccaagaagttacaattatatgggcagacctgc gcagtctgtctggaagacttcaaggggaaggatgagttaggcgtgctcccgtgccaacac gcctttcaccgcaagtgtctggtgaaatggctggaagttcgctgtgtctgccccatgtgt aacaagcccattgctagtccctcagaggccacgcagaacattgggattctattggatgag ctggtgtga >gi568815590r:33448756_33658772|GENSCAN_predicted_peptide_6|544_aa MEYYTAIKKNEILSFATTWMELEVIFGSPILWPQTSTSPWPVRNWAAQQEASAFLLSGIT AAELEVEIVPGPSSSTLSQKSFPLENFPCQLHAAKTELTTGPSGYPGTNYPSGGITQALK MEAAKAQGEIECSTGGPAQVEDPAHPPQLLAWVLSPLEPAASGAGRLLRAQTPETFLWTW TSSLSQCQASPHRLRLKAHFGAKSASVDSQKASETAAATAAGASWPWRCLRRRSAFRTEA QRSACKESQRAEQKELLQLEEKATPEGSCQSQAYLLSLENLLPKDQPREGKQLLWTHSSV SHTPSSSSSSYLATEPIITTATPAAATVSATSKMCPGNWLWASMTFMARFSRSSSRSPVR TRGTLEEMPTVQHPFLNVFELERLLYTGKTACNHADEVWPGLYLGDQDMANNRRELRRLG ITHVLNASHSRWRGTPEAYEGLGIRYLGVEAHDSPAFDMSIHFQTAADFIHRALSQPGGK ILVHCAVGVSRSATLVLAYLMLYHHLTLVEAIKKVKDHRGIIPNRGFLRQLLALDRRLRQ GLEA >gi568815590r:33448756_33658772|GENSCAN_predicted_CDS_6|1635_bp atggagtactatacagccattaaaaagaatgagattctgtcatttgccacaacatggatg gaactggaagtcattttcgggtccccaatcctctggccacagaccagtaccagtccatgg ccagttaggaactgggctgcacagcaggaggcttctgcctttttgctctcaggaattacc gctgctgaactggaagtagagattgtccctggcccttcctcatcaacattatctcagaag tccttccctcttgaaaactttccatgccagcttcatgctgccaagactgaattaactacc ggaccttcaggataccctgggaccaactatccctctgggggcattacccaggctcttaaa atggaggcagctaaggcccagggagaaattgagtgcagcaccggtgggccagcacaggtg gaagacccggcacaccctccgcagctgctggcctgggtgctaagcccgttagagcccgcg gccagcggggctggtcggctgctccgagctcagactccagaaacattcttatggacttgg acatcaagcctatcccagtgccaggccagccctcacagactaaggctcaaggcccacttt ggtgccaagtcagcttctgtggactcacagaaggcttcagaaacagcggcggcgacagca gcaggagcgtcatggccgtggcgctgtctgcgccggcgatccgcctttcggactgaggcc cagcgcagcgcttgcaaagagagccaaagagctgagcaaaaggagctgctgcagcttgaa gaaaaagccacccctgaaggctcctgccagagccaggcttaccttctctccctcgagaat cttctcccgaaggaccagcccagggaagggaagcagctgctctggactcactcaagtgtg tctcacaccccttcctcttccagcagcagctacctggcaactgaacccatcatcaccaca gccactcctgcagctgccacggtttctgccacctctaagatgtgccctggtaactggctt tgggcttctatgacttttatggcccgcttctcccggagtagctcaaggtctcctgttcga actcgagggaccctggaggagatgccaaccgttcaacatcctttcctcaatgtcttcgag ttggagcggctcctctacacaggcaagacagcctgtaaccatgccgacgaggtctggcca ggcctctatctcggagaccaggacatggctaacaaccgccgggagcttcgccgcctgggc atcacgcacgtcctcaatgcctcacacagccggtggcgaggcacgcccgaggcctatgag gggctgggcatccgctacctgggtgttgaggcccacgactcgccagcctttgacatgagc atccacttccagacggctgccgacttcatccaccgggcgctgagccagccaggagggaag atcctggtgcattgtgctgtgggcgtgagccgatccgccaccctggtactggcctacctc atgctgtaccaccaccttaccctcgtggaggccatcaagaaagtcaaagaccaccgaggc atcatccccaaccggggcttcctgaggcagctcctggccctggaccgcaggctgcggcag ggtctggaagcatga